1
0

Commit Graph

  • 98ec215079 [HUDI-3178] Fixing metadata table compaction so as to not include uncommitted data (#4530) Sivabalan Narayanan 2022-01-08 10:34:47 -05:00
  • 46bb00e4df [HUDI-3139] Shade htrace and parquet-avro in presto bundle (#4495) Sagar Sumit 2022-01-08 20:59:36 +05:30
  • 827549949c [HUDI-2909] Handle logical type in TimestampBasedKeyGenerator (#4203) Sagar Sumit 2022-01-08 20:52:44 +05:30
  • 03a83ffeb5 [HUDI-3195] optimize spark3 pom and modify build command (#4538) Yann Byron 2022-01-08 15:21:39 +08:00
  • 4f6cdd73a3 [HUDI-3192] Spark metastore schema evolution broken (#4533) 董可伦 2022-01-08 10:48:37 +08:00
  • 518488c633 [HUDI-3185] HoodieConfig#getBoolean should return false when default not set (#4536) Sagar Sumit 2022-01-08 02:50:11 +05:30
  • 2e561defe9 [HUDI-2947] Fixing checkpoint fetch in detlastreamer (#4485) Sivabalan Narayanan 2022-01-07 11:38:58 -05:00
  • b1df60672b [MINOR] fix typos in DDLExecutor (#4534) 董可伦 2022-01-07 20:59:55 +08:00
  • 76a72641f1 [HUDI-3188] Update quick start guide for Kafka Connect Sink for Hudi (#4527) Y Ethan Guo 2022-01-07 04:56:08 -08:00
  • 2467c137e4 [HUDI-3100] Add config for hive conditional sync (#4440) Raymond Xu 2022-01-06 23:26:35 -08:00
  • b2b23f5d3a [HUDI-3183] Wrong result of HoodieArchivedTimeline loadInstants with TimeRangeFilter (#4521) YueZhang 2022-01-07 10:16:29 +08:00
  • d7afc58d0c [HUDI-3118] Add default HUDI_DIR in setupKafka.sh (#4460) Thinking Chen 2022-01-07 07:46:51 +08:00
  • f0c2912d35 [MINOR] Remove unused methods in HoodieColumnProjectionUtils (#4408) xuzifu666 2022-01-07 07:36:13 +08:00
  • 8718c30324 [HUDI-3165] Enabling InProcessLockProvider for all multi-writer tests instead of FileSystemBasedLockProviderTestClass (#4427) Sivabalan Narayanan 2022-01-06 13:04:10 -05:00
  • 2954027b92 [HUDI-52] Enabling savepoint and restore for MOR table (#4507) Sivabalan Narayanan 2022-01-06 10:56:08 -05:00
  • b6891d253f [HUDI-44] Adding support to preserve commit metadata for compaction (#4428) Sivabalan Narayanan 2022-01-06 09:57:37 -05:00
  • 50fa5a6aa7 Update HiveIncrementalPuller to configure filesystem (#4431) hehexiaoduantui 2022-01-06 15:49:30 +08:00
  • 205e48f53f [HUDI-3132] Minor fixes for HoodieCatalog fengli 2021-12-31 23:25:04 +08:00
  • eee715b3ff [HUDI-3168] Fixing null schema with empty commit in incremental relation (#4513) Vinish Reddy 2022-01-05 22:13:10 +05:30
  • 75133f9942 [HUDI-3170] Do not preserve filename when preserveCommitMetadata enabled (#4512) Sagar Sumit 2022-01-05 18:39:58 +05:30
  • 0e297c0c4c [HUDI-3171] Sync empty table to hive metastore (#4511) Danny Chan 2022-01-05 16:41:33 +08:00
  • a66212d204 [HUDI-2966] Closing LogRecordScanner in compactor (#4478) Sivabalan Narayanan 2022-01-04 21:57:18 -05:00
  • 37b15ff458 [HUDI-3147] Add endpoint_url to dynamodb lock provider (#4500) Nicolas Paris 2022-01-04 22:42:28 +01:00
  • bf4e3d63e7 [HUDI-3141] Metadata merged log record reader - avoiding NullPointerException when records by keys (#4505) Manoj Govindassamy 2022-01-04 13:41:33 -08:00
  • aaf5727495 [HUDI-2774] Handle duplicate instants when fetching pending clustering plans (#4118) Sagar Sumit 2022-01-05 03:02:05 +05:30
  • 7329d229d5 Adding tests to validate different key generators (#4473) Sivabalan Narayanan 2022-01-04 00:18:04 -05:00
  • 29ab6fb9ad [HUDI-3140] Fix bulk_insert failure on Spark 3.2.0 (#4498) leesf 2022-01-04 09:59:59 +08:00
  • 2b2ae34cb9 [HUDI-2558] Fixing Clustering w/ sort columns with null values fails (#4404) harshal 2022-01-03 12:19:43 +05:30
  • 0273f2e65d [MINOR] Update README.md (#4492) Raymond Xu 2022-01-02 20:34:37 -08:00
  • 1e2d2c437d [HUDI-3138] Fix broken UT test for TestHiveSyncTool.testDropPartitions (#4493) YueZhang 2022-01-03 11:43:30 +08:00
  • fe9406dd33 [HUDI-3131] fix ctas error in spark3.1.1 (#4476) Yann Byron 2022-01-02 19:06:55 +08:00
  • 1622b52c9c [HUDI-3136] Fix merge/insert/show partitions error on Spark3.2 (#4490) Yann Byron 2022-01-02 18:42:10 +08:00
  • 188d0338c4 [HUDI-3134] Fix insert error after adding columns on Spark 3.2.0 (#4488) leesf 2022-01-02 09:38:14 +08:00
  • bfa169d808 [HUDI-3040] Fix HoodieSparkBootstrapExample error info for usage (#4341) Aimiyoo 2022-01-01 15:38:38 +08:00
  • ef9923fc55 [HUDI-3107]Fix HiveSyncTool drop partitions using JDBC or hivesql or hms (#4453) YueZhang 2021-12-31 15:56:33 +08:00
  • 2444f40a4b [HUDI-3095] abstract partition filter logic to enable code reuse (#4454) Yuwei XIAO 2021-12-31 13:37:52 +08:00
  • e88b5fd450 [HUDI-3120] Cache compactionPlan in buffer (#4463) yuzhaojing 2021-12-31 13:12:32 +08:00
  • a4e622ac61 [HUDI-1951] Add bucket hash index, compatible with the hive bucket (#3173) Shawy Geng 2021-12-31 04:38:26 +08:00
  • 0f0088fe4b [HUDI-3124] Bootstrap when timeline have completed instant (#4467) yuzhaojing 2021-12-30 11:54:34 +08:00
  • 436becf3ea [HUDI-2675] Fix the exception 'Not an Avro data file' when archive and clean (#4016) 董可伦 2021-12-30 11:53:17 +08:00
  • 674c149234 [HUDI-3083] Support component data types for flink bulk_insert (#4470) Ron 2021-12-30 11:15:54 +08:00
  • 5c0e4ce005 Revert "[HUDI-3043] Revert async cleaner leak commit to unblock CI failure (#4343)" (#4465) Sivabalan Narayanan 2021-12-29 21:45:09 -05:00
  • 504747ecf4 [HUDI-3108] Fix Purge Drop MOR Table Cause error (#4455) ForwardXu 2021-12-29 20:23:23 +08:00
  • a29b27c7ca [MINOR] HoodieInstantTimeGenerator improve method used (#4462) xuzifu666 2021-12-29 18:43:16 +08:00
  • 9412281cb1 [HUDI-2983] Remove Log4j2 transitive dependencies (#4281) Udit Mehrotra 2021-12-28 07:15:05 -08:00
  • 3d7a8695cd Fixing dynamoDbLockConfig required prop check (#4422) Sivabalan Narayanan 2021-12-28 05:26:30 -05:00
  • 05942e018c [HUDI-2811] Support Spark 3.2 (#4270) Yann Byron 2021-12-28 16:12:44 +08:00
  • 32505d5adb [HUDI-3106] Fix HiveSyncTool not sync schema (#4452) ForwardXu 2021-12-28 14:11:14 +08:00
  • 1f7afba5e4 [HUDI-3093] fix spark-sql query table that write with TimestampBasedKeyGenerator (#4416) Yann Byron 2021-12-28 13:39:52 +08:00
  • 6409fc733d [HUDI-2374] Fixing AvroDFSSource does not use the overridden schema to deserialize Avro binaries (#4353) harshal 2021-12-28 09:31:21 +05:30
  • 282aa68552 [HUDI-3099] Purge drop partition for spark sql (#4436) ForwardXu 2021-12-28 09:38:26 +08:00
  • c81df99e50 [HUDI-3102] Do not store rollback plan in inflight instant (#4445) Danny Chan 2021-12-25 18:10:43 +08:00
  • 7b07aac286 [HUDI-3101] Excluding compaction instants from pending rollback info (#4443) Danny Chan 2021-12-25 14:10:45 +08:00
  • 4721073b43 [MINOR] Remove unused method in HoodieActiveTimeline (#4435) xuzifu666 2021-12-24 22:29:34 +08:00
  • 032b883bd1 [HUDI-3014] Add table option to set utc timezone (#4306) xuzifu666 2021-12-23 16:27:45 +08:00
  • 57f43de1ea [MINOR] Fix DedupeSparkJob typo (#4418) Aimiyoo 2021-12-23 03:51:26 +08:00
  • 5d93edc539 [HUDI-3060] drop table for spark sql (#4364) ForwardXu 2021-12-22 19:17:43 +08:00
  • 1a5f8693aa [HUDI-3011] Adding ability to read entire data with HoodieIncrSource with empty checkpoint (#4334) Sivabalan Narayanan 2021-12-22 02:13:06 -08:00
  • b5890cd17d Merge pull request #4308 from harsh1231/HUDI-3008 xiarixiaoyao 2021-12-22 16:46:57 +08:00
  • 15eb7e81fc [HUDI-2547] Schedule Flink compaction in service (#4254) yuzhaojing 2021-12-22 15:08:47 +08:00
  • f1286c2c76 [HUDI-3032] Do not clean the log files right after compaction for metadata table (#4336) Danny Chan 2021-12-22 11:10:27 +08:00
  • 92f54ce3d8 [HUDI-3027] Update hudi-examples README.md (#4330) Aimiyoo 2021-12-22 05:36:03 +08:00
  • 7d046f914a [HUDI-3008] Fixing HoodieFileIndex partition column parsing for nested fields harshal patil 2021-12-14 17:28:18 +05:30
  • 32a44bbe06 [HUDI-2970] Add test for archiving replace commit (#4345) Raymond Xu 2021-12-20 21:01:59 -08:00
  • f3f6112b75 [HUDI-3070] Add rerunFailingTestsCount for flakly testes (#4398) zhangyue19921010 2021-12-21 11:59:50 +08:00
  • 982ae3d1eb [MINOR] Increasing CI timeout to 90 mins (#4407) Sivabalan Narayanan 2021-12-20 17:27:22 -08:00
  • f166ddad12 [MINOR] Remove unused method in HoodieActiveTimeline (#4401) xuzifu666 2021-12-20 22:19:37 +08:00
  • 3ca92108b2 remove unused import (#4349) xuzifu666 2021-12-20 16:32:41 +08:00
  • 4a48f99a59 [HUDI-3064][HUDI-3054] FileSystemBasedLockProviderTestClass tryLock fix and TestHoodieClientMultiWriter test fixes (#4384) Manoj Govindassamy 2021-12-19 10:31:02 -08:00
  • 03f71ef1a2 [HUDI-2970] Adding tests for archival of replace commit actions (#4268) Sivabalan Narayanan 2021-12-18 23:59:39 -08:00
  • 478f9f3695 [minor] fix NetworkUtils#getHostname (#4355) Danny Chan 2021-12-19 10:09:48 +08:00
  • bb99836841 [HUDI-3052] Fix flaky testJsonKafkaSourceResetStrategy (#4381) Raymond Xu 2021-12-18 17:58:51 -08:00
  • f57e28fe39 [MINOR] Azure CI IT tasks clean up (#4337) Raymond Xu 2021-12-18 17:00:56 -08:00
  • 77abb5ccb9 [HUDI-3054] Fixing default lock configs for FileSystemBasedLock and fixing a flaky test (#4374) Sivabalan Narayanan 2021-12-18 13:15:48 -08:00
  • dc40397fa9 [HUDI-3064] Fixing a bug in TransactionManager and FileSystemTestLock (#4372) Sivabalan Narayanan 2021-12-18 08:52:11 -08:00
  • 733732be7b [HUDI-3029] Transaction manager: avoid deadlock when doing begin and end transactions (#4373) Manoj Govindassamy 2021-12-18 08:43:10 -08:00
  • d1d48ed494 [HUDI-3029] Transaction manager: avoid deadlock when doing begin and end transactions (#4363) Manoj Govindassamy 2021-12-18 06:43:17 -08:00
  • 47852446e8 [HUDI-3043] De-coupling multi writer tests (#4362) Sivabalan Narayanan 2021-12-17 18:37:45 -08:00
  • 7784249e55 [HUDI-2962] InProcess lock provider to guard single writer process with async table operations (#4259) Manoj Govindassamy 2021-12-17 17:18:46 -08:00
  • 6eba8345cb [HUDI-3043] Adding some test fixes to continuous mode multi writer tests (#4356) Sivabalan Narayanan 2021-12-17 15:45:05 -05:00
  • 9246b16492 [HUDI-2958] Automatically set spark.sql.parquet.writelegacyformat, when using bulkinsert to insert data which contains decimalType (#4253) xiarixiaoyao 2021-12-17 21:58:02 +08:00
  • e4cfb421c0 [HUDI-3046] Claim RFC number for RFC for Compaction / Clustering Service (#4347) yuzhaojing 2021-12-17 14:52:45 +08:00
  • d0087d4040 [HUDI-3037] Add back remote view storage config for flink (#4338) Danny Chan 2021-12-17 13:57:53 +08:00
  • 7e7ad1558c [HUDI-3043] Revert async cleaner leak commit to unblock CI failure (#4343) Sivabalan Narayanan 2021-12-16 21:51:28 -05:00
  • 294d712948 [HUDI-3001] Clean up the marker directory when finish bootstrap operation. (#4298) xiarixiaoyao 2021-12-17 04:36:01 +08:00
  • a8a192aef5 [Minor] Catch and ignore all the exceptions in quietDeleteMarkerDir (#4301) zhangyue19921010 2021-12-17 03:15:08 +08:00
  • ea2eba1a55 [HUDI-3015] Implement #reset and #sync for metadata filesystem view (#4307) Danny Chan 2021-12-16 15:26:16 +08:00
  • f5b07a77bc [HUDI-2998] claiming rfc number for consistent hashing index (#4303) Yuwei XIAO 2021-12-16 07:47:22 +08:00
  • 27907dea4c [HUDI-3028] Use blob storage to speed up CI downloads (#4331) Raymond Xu 2021-12-15 15:33:33 -08:00
  • 3b89457e2b [HUDI-3025] Add additional wait time for namenode availability during IT tests initiatialization (#4328) Y Ethan Guo 2021-12-15 10:44:42 -08:00
  • 9a2030ab31 [HUDI-3024] Add explicit write handler for flink (#4329) WangMinChao 2021-12-15 20:16:48 +08:00
  • dbec6c512b [HUDI-3022] Fix NPE for isDropPartition method (#4319) ForwardXu 2021-12-15 19:38:02 +08:00
  • bc8bf043d5 [HUDI-2995] Enabling metadata table by default (#4295) Manoj Govindassamy 2021-12-14 13:11:19 -08:00
  • c8d6bd8898 [HUDI-2997] Skip the corrupt meta file for pending rollback action (#4296) Danny Chan 2021-12-14 14:08:13 +08:00
  • 29bc5fd912 [HUDI-2996] Flink streaming reader 'skip_compaction' option does not work (#4304) Fugle666 2021-12-14 11:31:36 +08:00
  • 46de25d5fd [HUDI-2994] Add judgement to existed partitionPath in the catch code block for HU… (#4294) WangMinChao 2021-12-13 20:41:03 +08:00
  • dd96129191 [HUDI-2990] Sync to HMS when deleting partitions (#4291) ForwardXu 2021-12-13 20:40:06 +08:00
  • b22c2c611b [HUDI-2938] Metadata table util to get latest file slices for reader/writers (#4218) Manoj Govindassamy 2021-12-11 20:42:36 -08:00
  • 15444c951f [HUDI-2946] Upgrade maven plugins to be compatible with higher Java versions (#4232) wenningd 2021-12-11 23:18:39 -05:00
  • 8dd0444ef9 [HUDI-2984] Implement #close for AbstractTableFileSystemView (#4285) Danny Chan 2021-12-11 16:19:10 +08:00