1
0

Commit Graph

  • 435ea1543c [HUDI-2793] Fixing deltastreamer checkpoint fetch/copy over (#4034) Sivabalan Narayanan 2021-11-24 18:26:40 -05:00
  • ff94d92980 [HUDI-2766] Cluster update strategy should not be fenced by write config (#4093) Sagar Sumit 2021-11-24 23:45:40 +05:30
  • 60b23b9797 [HUDI-2788] Fixing issues w/ Z-order Layout Optimization (#4026) Alexey Kudinkin 2021-11-24 10:10:28 -08:00
  • 973f78f5ca [HUDI-2443] Hudi KVComparator for all HFile writer usages (#3889) Manoj Govindassamy 2021-11-24 10:05:36 -08:00
  • 90f2ea2f12 [HUDI-2671] Fix kafka offset handling in Kafka Connect protocol (#4021) rmahindra123 2021-11-24 10:03:58 -08:00
  • 9af219b7c1 [HUDI-2688] Claim the next rfc 40 for Hudi connector for Trino (#4105) Sagar Sumit 2021-11-24 22:13:37 +05:30
  • a234833f0a [HUDI-2759] extract HoodieCatalogTable to coordinate spark catalog table and hoodie table (#3998) Yann Byron 2021-11-24 18:12:38 +08:00
  • 0bb506fa00 [HUDI-2847] Flink metadata table supports virtual keys (#4096) Danny Chan 2021-11-24 17:34:42 +08:00
  • 323be33f18 Revert "[HUDI-2799] Fix the classloader of flink write task (#4042)" (#4069) Danny Chan 2021-11-24 12:01:18 +08:00
  • 0cf2f103e0 [HUDI-2838] refresh table after drop partition (#4084) Yann Byron 2021-11-24 11:46:48 +08:00
  • 5078d29eb4 [HUDI-2818] Fix 2to3 upgrade when set hoodie.table.keygenerator.class (#4077) Raymond Xu 2021-11-23 19:43:34 -08:00
  • 18cf59507f [HUDI-2831] Securing usages of SimpleDateFormat to be thread-safe (#4073) Alexey Kudinkin 2021-11-23 17:25:11 -08:00
  • fbff0799b9 [HUDI-2325] Add hive sync support to kafka connect (#3660) rmahindra123 2021-11-23 15:48:06 -08:00
  • 969a5bf11e [MINOR] Fix typo,rename 'HooodieAvroDeserializer' to 'HoodieAvroDeserializer' (#4064) 董可伦 2021-11-23 19:10:57 +08:00
  • ca9bfa2a40 [HUDI-2332] Add clustering and compaction in Kafka Connect Sink (#3857) Y Ethan Guo 2021-11-23 00:53:28 -08:00
  • 9ed28b1570 [HUDI-2409] Using HBase shaded jars in Hudi presto bundle (#3623) zhangyue19921010 2021-11-23 13:55:12 +08:00
  • 9de9951348 [HUDI-2778] Optimize statistics collection related codes and add some docs for z-order add fix some bugs (#4013) xiarixiaoyao 2021-11-23 13:46:02 +08:00
  • c88c2af8bf [HUDI-2743] Assume path exists and defer fs.exists() in AbstractTableFileSystemView (#4002) Sagar Sumit 2021-11-23 08:43:10 +05:30
  • 6aa710eae0 [MINOR] Add more configuration to Kafka setup script (#3992) Y Ethan Guo 2021-11-22 18:03:38 -08:00
  • e22150fe15 [HUDI-1937] Rollback unfinished replace commit to allow updates (#3869) Sagar Sumit 2021-11-23 07:29:03 +05:30
  • 0d1e7ecdab [MINOR] Fix typo,'multipe' corrected to 'multiple' (#4068) Jimmy.Zhou 2021-11-23 09:20:23 +08:00
  • 772af935d5 [HUDI-2737] Use earliest instant by default for async compaction and clustering jobs (#3991) Y Ethan Guo 2021-11-22 17:19:41 -08:00
  • 3bdab01a49 [HUDI-2550] Expand File-Group candidates list for appending for MOR tables (#3986) Alexey Kudinkin 2021-11-22 16:19:59 -08:00
  • fe57e9beea [HUDI-2599] Make addFilesToview and fetchLatestBaseFiles public (#4066) Sagar Sumit 2021-11-22 22:53:50 +05:30
  • fc9ca6a07a [HUDI-2559] Converting commit timestamp format to millisecs (#4024) Sivabalan Narayanan 2021-11-22 11:44:38 -05:00
  • 89452063b4 [MINOR] Fix instant parsing in HoodieClusteringJob (#4071) Sagar Sumit 2021-11-22 19:27:44 +05:30
  • 7f3b89fad7 [HUDI-2472] Enabling metadata table for TestHoodieIndex test case (#4045) Manoj Govindassamy 2021-11-22 04:21:24 -08:00
  • a2c91a7a9b [HUDI-2533] New option for hoodieClusteringJob to check, rollback and re-execute the last failed clustering job (#3765) zhangyue19921010 2021-11-22 19:00:33 +08:00
  • 02f7ca2b05 [HUDI-1870] Add more Spark CI build tasks (#4022) Raymond Xu 2021-11-22 02:16:45 -08:00
  • 8281cbf762 [HUDI-2799] Fix the classloader of flink write task (#4042) Danny Chan 2021-11-22 11:05:05 +08:00
  • 2533a9cc17 [MINOR] Fix typos (#4053) 董可伦 2021-11-21 16:34:59 +08:00
  • 887787e8b9 [HUDI-1932] Update Hive sync timestamp when change detected (#3053) Nate Radtke 2021-11-21 00:41:05 -06:00
  • 520538b15d [HUDI-2392] Make flink parquet reader compatible with decimal BINARY encoding (#4057) Danny Chan 2021-11-21 13:27:18 +08:00
  • 0411f73c7d [HUDI-2804] Add option to skip compaction instants for streaming read (#4051) Danny Chan 2021-11-21 12:38:56 +08:00
  • 74b59a44ec [HUDI-2813] Claim RFC number for RFC for spark datasource V2 Integration (#4059) leesf 2021-11-21 10:59:12 +08:00
  • 305d160081 [MINOR] optimize in constructor of inputbatch class (#4040) dufeng1010 2021-11-21 10:11:01 +08:00
  • 1a5484d2db [MINOR] Claim RFC number for RFC for debezium source for deltastreamer (#4047) rmahindra123 2021-11-20 17:28:48 -08:00
  • ae0c67d9fc [HUDI-2795] Add mechanism to safely update,delete and recover table properties (#4038) vinoth chandar 2021-11-20 08:07:40 -08:00
  • f4b974ac7b [HUDI-2742] Added S3 object filter to support multiple S3EventsHoodieIncrSources single S3 meta table (#4025) Harsha Teja Kanna 2021-11-20 03:24:21 -06:00
  • 6cc97cc0c9 Remove the aws packages from hudi flink bundle jar (#4050) Ron 2021-11-20 11:55:12 +08:00
  • 3dc6262437 [HUDI-2242] Add configuration inference logic for few options (#3359) wenningd 2021-11-19 19:38:38 -08:00
  • 0230d40b74 [HUDI-2796] Metadata table support for Restore action to first commit (#4039) Manoj Govindassamy 2021-11-19 17:02:57 -08:00
  • c8617d9390 [HUDI-2472] Enabling metadata table for TestHoodieMergeOnReadTable and TestHoodieCompactor (#4023) Manoj Govindassamy 2021-11-19 17:02:21 -08:00
  • 459b34240b [HUDI-2593] Virtual keys support for metadata table (#3968) Manoj Govindassamy 2021-11-19 15:11:29 -08:00
  • eba354e922 [HUDI-2731] Make clustering work regardless of whether there are base… (#3970) Sagar Sumit 2021-11-19 21:39:08 +05:30
  • bf008762df [HUDI-2798] Fix flink query operation fields (#4041) Danny Chan 2021-11-19 23:39:37 +08:00
  • 7a00f867ae [HUDI-2791] Allows duplicate files for metadata commit (#4033) Danny Chan 2021-11-19 14:30:17 +08:00
  • 4e067ca581 [HUDI-2641] Avoid deleting all inflight commits heartbeats while rolling back failed writes (#3956) Udit Mehrotra 2021-11-18 05:33:50 -08:00
  • 24def0b30d [HUDI-2362] Add external config file support (#3416) wenningd 2021-11-18 01:59:26 -08:00
  • 8772cec4bd [HUDI-2790] Fix the changelog mode of HoodieTableSource (#4029) Danny Chan 2021-11-18 16:40:48 +08:00
  • 71a2ae0fd6 [HUDI-2789] Flink batch upsert for non partitioned table does not work (#4028) Danny Chan 2021-11-18 13:59:03 +08:00
  • 2d3f2a3275 [HUDI-2734] Setting default metadata enable as false for Java (#4003) Sivabalan Narayanan 2021-11-17 14:43:00 -05:00
  • f715cf607f [HUDI-2716] InLineFS support for S3FS logs (#3977) Manoj Govindassamy 2021-11-17 10:59:38 -08:00
  • 1ee12cfa6f [HUDI-2314] Add support for DynamoDb based lock provider (#3486) wenningd 2021-11-17 09:09:31 -08:00
  • 826414cff5 [MINOR] Add the Schema for GooseFS to StorageSchemes (#3982) 卢波 2021-11-17 22:47:52 +08:00
  • 4d884bdaa9 [MINOR] Fix typo,'Hooide' corrected to 'Hoodie' (#4007) 董可伦 2021-11-17 16:50:04 +08:00
  • aec5d11da2 Check --source-avro-schema-path parameter (#3987) 0x574C 2021-11-17 14:45:43 +08:00
  • ce7d233307 [HUDI-2151] Part3 Enabling marker based rollback as default rollback strategy (#3950) Sivabalan Narayanan 2021-11-17 01:21:28 -05:00
  • 04eb5fdc65 [HUDI-2753] Ensure list based rollback strategy is used for restore (#3983) Sivabalan Narayanan 2021-11-16 23:36:55 -05:00
  • cbcbec4d38 [MINOR] Fixed checkstyle config to be based off Maven root-dir (requires Maven >=3.3.1 to work properly); (#4009) Alexey Kudinkin 2021-11-16 18:30:16 -08:00
  • 6f5e661010 [HUDI-2769] Fix StreamerUtil#medianInstantTime for very near instant time (#4005) Danny Chan 2021-11-16 13:46:34 +08:00
  • bff8769ed4 [HUDI-2712] Fixing a bug with rollback of partially failed commit which has new partitions (#3947) Sivabalan Narayanan 2021-11-15 22:36:03 -05:00
  • 38b6934352 [HUDI-2683] Parallelize deleting archived hoodie commits (#3920) zhangyue19921010 2021-11-15 22:36:54 +08:00
  • 53d2d6ae24 [HUDI-2744] Fix parsing of metadadata table compaction timestamp when metrics are enabled (#3976) Sivabalan Narayanan 2021-11-15 07:27:35 -05:00
  • 3c4319729c [MINOR] Fix typo in IntervalTreeBasedGlobalIndexFileFilter (#3993) dufeng1010 2021-11-15 14:39:43 +08:00
  • a0dae41409 [HUDI-2758] remove redundant code in the hoodieRealtimeInputFormatUitls.getRealtimeSplits (#3994) xiarixiaoyao 2021-11-15 11:29:40 +08:00
  • a14d1040b9 [HUDI-2589] Claiming RFC-37 for Metadata based bloom index feature. (#3995) Manoj Govindassamy 2021-11-14 17:47:41 -08:00
  • 0bb6d8ff80 [HUDI-2706] refactor spark-sql to make consistent with DataFrame api (#3936) Yann Byron 2021-11-15 07:44:39 +08:00
  • c2f9094b49 [HUDI-2756] Fix flink parquet writer decimal type conversion (#3988) Danny Chan 2021-11-14 08:51:54 +08:00
  • 994922a159 [HUDI-2472] Enabling metadata table in TestHoodieIndex and TestMergeOnReadRollbackActionExecutor (#3978) Manoj Govindassamy 2021-11-13 16:37:30 -08:00
  • 0e8461e9ab [HUDI-2697] Minor changes about hbase index config. (#3927) xiarixiaoyao 2021-11-13 09:12:33 +08:00
  • 93fd3517e3 [HUDI-2741] Fixing instantiating metadata table config in HoodieFileIndex (#3974) Sivabalan Narayanan 2021-11-12 17:28:25 -05:00
  • 9720820975 [HUDI-2718] ExternalSpillableMap payload size re-estimation throws ArithmeticException (#3955) Manoj Govindassamy 2021-11-12 05:18:40 -08:00
  • 4f217fe718 [HUDI-2151] Part1 Setting default parallelism to 200 for some of write configs (#3948) Sivabalan Narayanan 2021-11-12 07:29:37 -05:00
  • bc511edc85 [HUDI-2746] Do not bootstrap for flink insert overwrite (#3980) Danny Chan 2021-11-12 12:17:58 +08:00
  • 6b93ccca9b [HUDI-2738] Remove the bucketAssignFunction useless context (#3972) yuzhaojing 2021-11-11 21:03:01 +08:00
  • 90529aa552 [HUDI-2495] Resolve inconsistent key generation for timestamp types by GenericRecord and Row (#3944) Yann Byron 2021-11-11 11:54:34 +08:00
  • 77b0440eb4 [HUDI-2634] Improved the metadata table bootstrap for very large tables. (#3873) Prashant Wason 2021-11-10 19:37:48 -08:00
  • 90f9b4562a [HUDI-2685] Support scheduling online compaction plan when there are no commit data (#3928) yuzhaojing 2021-11-11 10:13:21 +08:00
  • 2d362af00a [HUDI-2730] Move EventTimeAvroPayload into hudi-common module (#3959) yuzhaojing 2021-11-10 20:22:24 +08:00
  • 187bedf795 [HUDI-2442] Change default values for certin clustering configs (#3875) Sagar Sumit 2021-11-10 14:23:24 +05:30
  • a40ac62e0c [HUDI-2086]redo the logical of mor_incremental_view for hive (#3203) xiarixiaoyao 2021-11-10 15:41:07 +08:00
  • fd0f5df26d [HUDI-2297] Estimate available memory size for spillable map accurately. (#3455) Shawy Geng 2021-11-10 14:05:12 +08:00
  • bb6a19e7d7 [HUDI-1877] Support records staying in same fileId after clustering (#3833) Sagar Sumit 2021-11-10 09:47:50 +05:30
  • dfe3b84715 [HUDI-2579] Make deltastreamer checkpoint state merging more explicit (#3820) davehagman 2021-11-09 17:37:59 -05:00
  • 2f95967dfe [HUDI-2591] Bootstrap metadata table only if upgrade / downgrade is not required. (#3836) Prashant Wason 2021-11-09 07:26:20 -08:00
  • e057a10499 [HUDI-2715] The BitCaskDiskMap iterator may cause memory leak (#3951) Danny Chan 2021-11-09 15:40:00 +08:00
  • 6d109c6de5 [HUDI-2595] Fixing metadata table updates such that only regular writes from data table can trigger table services in metadata table (#3900) Sivabalan Narayanan 2021-11-08 22:12:32 -05:00
  • 7aaf47e716 [HUDI-2698] Remove the table source options validation (#3940) yuzhaojing 2021-11-08 16:56:03 +08:00
  • c7bf2c7687 [HUDI-2709] Add more options when initializing table (#3939) Danny Chan 2021-11-08 15:08:49 +08:00
  • cf2ecd77ba [HUDI-2679] Fix the TestMergeIntoLogOnlyTable typo. (#3918) Shawy Geng 2021-11-08 02:19:17 +08:00
  • e0285800fb HUDI-1827 : Add ORC support in Bootstrap Op (#3457) manasaks 2021-11-06 21:53:20 +05:30
  • f41539a9cb [HUDI-313] bugfix: NPE when select count start from a realtime table with Tez(#3630) Genmao Yu 2021-11-07 00:16:13 +08:00
  • 9a8963d05e [HUDI-2702] Set up keygen class explicit for write config for flink table upgrade (#3931) Danny Chan 2021-11-06 12:23:15 +08:00
  • 08c35a55b3 [HUDI-2526] Make spark.sql.parquet.writeLegacyFormat configurable (#3917) Sagar Sumit 2021-11-05 22:33:41 +05:30
  • 844346c3ab [HUDI-2471] Add support ignoring case in merge into (#3700) 董可伦 2021-11-05 22:50:16 +08:00
  • b7ee341e14 [HUDI-1794] Moved static COMMIT_FORMATTER to thread local variable as SimpleDateFormat is not thread safe. (#2819) Prashant Wason 2021-11-05 06:31:42 -07:00
  • 3af6568d31 [HUDI-2696] Remove the aborted checkpoint notification from coordinator (#3926) Danny Chan 2021-11-05 16:37:23 +08:00
  • f67da0c7d0 [HUDI-2686] Proccess record after all bootstrap operator ready (#3925) yuzhaojing 2021-11-05 14:36:22 +08:00
  • 2c1e259329 [HUDI-2651] Sync all the missing sql options for HoodieFlinkStreamer (#3903) yuzhaojing 2021-11-05 12:16:21 +08:00