1
0

Commit Graph

  • 9625d16937 [HUDI-3849] AvroDeserializer supports AVRO_REBASE_MODE_IN_READ configuration (#5287) cxzl25 2022-05-07 15:39:14 +08:00
  • 52fe1c9fae [HUDI-3675] Adding post write termination strategy to deltastreamer continuous mode (#5073) Sivabalan Narayanan 2022-05-06 09:27:29 -04:00
  • c319ee9cea [HUDI-4017] Improve spark sql coverage in CI (#5512) Raymond Xu 2022-05-06 05:52:06 -07:00
  • 248b0591b0 [HUDI-4042] Support truncate-partition for Spark-3.2 (#5506) Jin Xing 2022-05-06 15:29:47 +08:00
  • abb4893b25 [HUDI-2875] Make HoodieParquetWriter Thread safe and memory executor exit gracefully (#4264) guanziyue 2022-05-06 04:49:34 +08:00
  • d794f4fbf9 [MINOR] Optimize code logic (#5499) qianchutao 2022-05-06 00:33:06 +08:00
  • f66e83dc65 [HUDI-3667] Run unit tests of hudi-integ-tests in CI (#5078) Y Ethan Guo 2022-05-04 23:39:18 -07:00
  • 1562bb658f [HUDI-4031] Avoid clustering update handling when no pending replacecommit (#5487) Sagar Sumit 2022-05-04 19:47:11 +05:30
  • 8c9209db28 [HUDI-4005] Update release scripts to help validation (#5479) Raymond Xu 2022-05-04 07:15:54 -07:00
  • 3343cbb47b [MINOR] Update RFC status (#5486) Sagar Sumit 2022-05-03 21:27:18 +05:30
  • 9732ba12da [HUDI-3211][RFC-44] Add RFC for Hudi Connector for Presto (#4563) Todd Gao 2022-05-03 00:35:23 +08:00
  • 6af1ff7a66 [MINOR] Update DOAP for release 0.11.0 (#5467) Raymond Xu 2022-04-30 10:51:16 -07:00
  • 33ff4752ba [HUDI-3978] Fix use of partition path field as hive partition field in flink (#5434) Wangyh 2022-04-30 11:58:54 +08:00
  • f492c52ee4 [HUDI-3862] Fix default configurations of HoodieHBaseIndexConfig (#5308) xicm 2022-04-30 07:21:52 +08:00
  • a1d82b4dc5 [MINOR] Fix CI by ignoring SparkContext error (#5468) Y Ethan Guo 2022-04-29 11:19:07 -07:00
  • e421d536ea [HUDI-3758] Fix duplicate fileId error in MOR table type with flink bucket hash Index (#5185) 吴祥平 2022-04-29 14:10:20 +08:00
  • b27e8b51d8 [MINOR] support different cleaning policy for flink (#5459) Gary Li 2022-04-29 09:48:44 +08:00
  • 4e928a6fe1 [HUDI-3943] Some description fixes for 0.10.1 docs (#5447) LiChuang 2022-04-29 06:18:56 +08:00
  • 52953c8f5e [HUDI-3815] Fix docs description of metadata.compaction.delta_commits default value error (#5368) Ibson 2022-04-28 07:09:44 +08:00
  • cacbd98687 [HUDI-3945] After the async compaction operation is complete, the task should exit. (#5391) watermelon12138 2022-04-27 21:16:09 +08:00
  • 924e2e96a6 Claim RFC 52 for Introduce Secondary Index to Improve HUDI Query Performance (#5441) huberylee 2022-04-27 14:07:29 +08:00
  • e1ccf2e00b [HUDI-3977] Flink hudi table with date type partition path throws HoodieNotSupportedException (#5432) Danny Chan 2022-04-27 13:19:55 +08:00
  • 6ec039ba42 [MINOR] Update alter rename command class type for pattern matching (#5381) KnightChess 2022-04-27 10:39:51 +08:00
  • 77e333298d [HUDI-3478] Claim RFC 51 For CDC (#5437) Yann Byron 2022-04-26 23:26:47 +08:00
  • 762623a15c [HUDI-3972] Fixing hoodie.properties/tableConfig for no preCombine field with writes (#5424) Sivabalan Narayanan 2022-04-25 23:03:10 -04:00
  • f2ba0fead2 [HUDI-3085] Improve bulk insert partitioner abstraction (#4441) Yuwei XIAO 2022-04-25 18:42:17 +08:00
  • 9054b85961 Revert "[HUDI-3951]support generan parameter 'sink.parallelism' for flink-hudi (#5405)" (#5421) ForwardXu 2022-04-25 12:58:27 +08:00
  • d994c58cc0 [HUDI-3946] Validate option path in flink hudi sink (#5397) Ruguo Yu 2022-04-25 10:13:47 +08:00
  • bda3db078e support generan parameter 'sink.parallelism' for flink-hudi (#5405) hehuiyuan 2022-04-24 19:09:39 +08:00
  • 5e5c177e4b [HUDI-3923] Fix cast exception while reading boolean type of partitioned field (#5373) miomiocat 2022-04-23 20:12:54 +08:00
  • 8633bd6e06 [HUDI-3948] Fix presto bundle missing HBase classes (#5398) Y Ethan Guo 2022-04-23 01:33:55 -07:00
  • 505ee672ac [HUDI-3950] add parquet-avro to gcp-bundle (#5399) Raymond Xu 2022-04-22 20:59:49 -07:00
  • 7523542c1d [HUDI-3947] Fixing Hive conf usage in HoodieSparkSqlWriter (#5401) Sivabalan Narayanan 2022-04-22 22:20:05 -04:00
  • 20781a5fa6 [DOCS] Add commit activity, twitter badgers, and Hudi logo in README (#5336) Y Ethan Guo 2022-04-22 01:51:07 -07:00
  • c05a4e7b6f [HUDI-3934] Fix Spark32HoodieParquetFileFormat not being compatible w/ Spark 3.2.0 (#5378) Alexey Kudinkin 2022-04-21 18:00:38 -07:00
  • c4bc2deea0 [HUDI-3936] Fix projection for a nested field as pre-combined key (#5379) Y Ethan Guo 2022-04-21 17:17:57 -07:00
  • 037f89ee7c [HUDI-3921] Fixed schema evolution cannot work with HUDI-3855 (#5376) xiarixiaoyao 2022-04-22 06:27:54 +08:00
  • de5fa1fe03 [HUDI-3940] Fix retry count increment in lock manager (#5387) Sagar Sumit 2022-04-22 02:22:05 +05:30
  • 4e1ac467da [MINOR] Increase azure CI timeout to 120m (#5384) Raymond Xu 2022-04-21 04:35:44 -07:00
  • 4b296f79cc [HUDI-3935] Adding config to fallback to enabled Partition Values extraction from Partition path (#5377) Alexey Kudinkin 2022-04-21 01:36:19 -07:00
  • a9506aa545 [HUDI-3938] Fix default value for num retries to acquire lock (#5380) Sivabalan Narayanan 2022-04-21 04:08:43 -04:00
  • f7544e23ac [HUDI-3204] Fixing partition-values being derived from partition-path instead of source columns (#5364) Alexey Kudinkin 2022-04-20 04:30:27 -07:00
  • 408663c42b [HUDI-3912] Fix lose data when rollback in flink async compact (#5357) 吴祥平 2022-04-20 19:23:39 +08:00
  • 6a3ce928b1 [HUDI-3904] Claim RFC number for Improve timeline server (#5354) Zhaojing Yu 2022-04-20 14:31:21 +08:00
  • 7a9e411e9d [HUDI-3917] Flink write task hangs if last checkpoint has no data input (#5360) Danny Chan 2022-04-20 12:48:24 +08:00
  • 28fdddfee0 [HUDI-3920] Fix partition path construction in metadata table validator (#5365) Y Ethan Guo 2022-04-19 16:40:09 -07:00
  • 6f3fe880d2 [HUDI-3905] Add S3 related setup in Kafka Connect quick start (#5356) Y Ethan Guo 2022-04-19 15:08:28 -07:00
  • 81bf771e56 [HUDI-3902] Fallback to HadoopFsRelation in cases non-involving Schema Evolution (#5352) Alexey Kudinkin 2022-04-19 10:40:20 -07:00
  • 9af7b09aec [HUDI-3894] Fix gcp bundle to include HBase dependencies and shading (#5349) Raymond Xu 2022-04-18 21:47:10 -07:00
  • 4f44e6aeb5 [HUDI-3899] Drop index to delete pending index instants from timeline if applicable (#5342) Sagar Sumit 2022-04-19 07:58:46 +05:30
  • 52d878c52b [HUDI-3903] Fix NoClassDefFoundError with Kafka Connect bundle (#5353) Y Ethan Guo 2022-04-18 18:17:53 -07:00
  • ef6c5611dc [HUDI-3894] Fix datahub to include HBase dependencies and shading (#5338) Y Ethan Guo 2022-04-18 16:20:50 -07:00
  • 7ecb47cd21 [HUDI-3895] Fixing file-partitioning seq for base-file only views to make sure we bucket the files efficiently (#5337) Alexey Kudinkin 2022-04-18 13:06:52 -07:00
  • 1718bcab84 [HUDI-3707] Fix target schema handling in HoodieSparkUtils while creating RDD (#5347) Sagar Sumit 2022-04-18 23:04:04 +05:30
  • b00d03fd62 [HUDI-3886] Adding default null for some of the fields in col stats in MDT schema (#5329) Sivabalan Narayanan 2022-04-18 10:37:03 -04:00
  • 05dfc39c29 Fixing async clustering job test in TestHoodieDeltaStreamer (#5317) Sivabalan Narayanan 2022-04-18 08:08:33 -04:00
  • b8e465fdfc [MINOR] Fix typos in log4j-surefire.properties (#5212) 董可伦 2022-04-16 04:33:37 +08:00
  • 99dd1cb6e6 [HUDI-3835] Add UT for delete in java client (#5270) 董可伦 2022-04-16 03:03:48 +08:00
  • e8ab915aff [MINOR] Removing invalid code to close parquet reader iterator (#5182) Sivabalan Narayanan 2022-04-15 14:50:07 -04:00
  • 57612c5c32 [HUDI-3848] Fixing restore with cleaned up commits (#5288) Sivabalan Narayanan 2022-04-15 14:47:53 -04:00
  • 9e8664f4d2 [HOTFIX] add missing license (#5322) (#5324) Raymond Xu 2022-04-14 12:35:20 -07:00
  • d6a64f765e Revert "[HUDI-3652] Make ObjectSizeCalculator threadlocal to reduce memory footprint (#5060)" (#5323) Raymond Xu 2022-04-14 12:28:27 -07:00
  • f0ab4a6e9e [HUDI-3652] Make ObjectSizeCalculator threadlocal to reduce memory footprint (#5060) sekaiga 2022-04-14 18:08:14 +08:00
  • 6621f3cdbb [HUDI-3845] Fix delete mor table's partition with urlencode's error (#5282) ForwardXu 2022-04-14 16:49:00 +08:00
  • 44b3630b5d [HUDI-3826] Make truncate partition use delete_partition operation (#5272) ForwardXu 2022-04-14 15:53:05 +08:00
  • a081c2b9b5 [HUDI-3876] Fixing fetching partitions in GlueSyncClient (#5318) Sivabalan Narayanan 2022-04-14 00:03:05 -04:00
  • 571cbe4c11 [MINOR] Code cleanup in test utils (#5312) Y Ethan Guo 2022-04-13 14:37:07 -07:00
  • bab691692e [HUDI-3686] Fix inline and async table service check in HoodieWriteConfig (#5307) Y Ethan Guo 2022-04-13 14:33:26 -07:00
  • c7f41f9018 [HUDI-3869] Improve error handling of loading Hudi conf (#5311) Y Ethan Guo 2022-04-13 14:25:31 -07:00
  • 6f9b02decb [HUDI-3870] Add timeout rollback for flink online compaction (#5314) Danny Chan 2022-04-13 20:05:48 +08:00
  • 0281725c6b [MINOR] Inline the partition path logic into the builder (#5310) Danny Chan 2022-04-13 19:24:39 +08:00
  • 43de2b4702 [HUDI-3868] Disable the sort input for flink streaming append mode (#5309) Danny Chan 2022-04-13 14:21:08 +08:00
  • 434e782b7d [HUDI-3867] Disable Data Skipping by default (#5306) Alexey Kudinkin 2022-04-12 22:51:12 -07:00
  • 7b78dff45f [HUDI-3855] Fixing FILENAME_METADATA_FIELD not being correctly updated in HoodieMergeHandle (#5296) Alexey Kudinkin 2022-04-12 17:42:15 -07:00
  • 2e6e302efe [HUDI-3859] Fix spark profiles and utilities-slim dep (#5297) Raymond Xu 2022-04-12 15:33:08 -07:00
  • 2d46d5287e [HUDI-3838] Moved the getPartitionColumns logic to driver. (#5303) Vinoth Govindarajan 2022-04-12 15:03:00 -07:00
  • 25dce94ba2 [MINOR] Integ Test Reducing partitions for log running multi partition yaml (#5300) satishm 2022-04-12 21:45:17 +05:30
  • 84783b9779 [HUDI-3843] Make flink profiles build with scala-2.11 (#5279) Raymond Xu 2022-04-12 08:33:48 -07:00
  • d16740976e [HUDI-3838] Implemented drop partition column feature for delta streamer code path (#5294) Vinoth Govindarajan 2022-04-12 05:40:30 -07:00
  • 101b82a679 [HUDI-3839] Fixing incorrect selection of MT partitions to be updated (#5274) Alexey Kudinkin 2022-04-12 01:07:52 -07:00
  • f91e9e63e1 [HUDI-3799] Fixing not deleting empty instants w/o archiving (#5261) Sivabalan Narayanan 2022-04-11 21:02:43 -07:00
  • 3d8fc78c66 [HUDI-3844] Update props in indexer based on table config (#5293) Sagar Sumit 2022-04-12 03:46:06 +05:30
  • 458fdd5611 [HUDI-3841] Fixing Column Stats in the presence of Schema Evolution (#5275) Alexey Kudinkin 2022-04-11 12:45:53 -07:00
  • 52ea1e4964 [MINOR] fixing timeline server for integ tests (#5289) Sivabalan Narayanan 2022-04-11 07:14:51 -07:00
  • 5c41e30ac5 [HUDI-3817] shade parquet dependency for hudi-hadoop-mr-bundle (#5250) RexXiong 2022-04-11 20:44:46 +08:00
  • 2245a9515f [HUDI-3798] Fixing ending of a transaction by different owner and removing some extraneous methods in trxn manager (#5255) Sivabalan Narayanan 2022-04-10 21:46:07 -07:00
  • 63a099c5b7 [HUDI-3847] Fix NPE due to null schema in HoodieMetadataTableValidator (#5284) Y Ethan Guo 2022-04-10 17:59:29 -07:00
  • 12731f5b89 [HUDI-3842] Integ tests for non partitioned datasets (#5276) Sivabalan Narayanan 2022-04-10 17:09:48 -07:00
  • 976840e8eb [HUDI-3812] Fixing Data Skipping configuration to respect Metadata Table configs (#5244) Alexey Kudinkin 2022-04-10 10:43:47 -07:00
  • 7a9d48d126 [HUDI-3834] Fixing performance hits in reading Column Stats Index (#5266) Alexey Kudinkin 2022-04-10 10:42:06 -07:00
  • 15c264535f [MINOR] Fix typos in the comments of HoodieMergeHandle (#5271) 董可伦 2022-04-10 08:51:58 +08:00
  • 3e97c88c4f [HUDI-3807] Add a new config to control the use of metadata index in HoodieBloomIndex (#5268) Y Ethan Guo 2022-04-09 12:30:11 -07:00
  • 5e65aefc61 [HUDI-3837] Fix license and rat check settings (#5273) Raymond Xu 2022-04-09 11:01:18 -07:00
  • 81b25c543a [HUDI-3825] Fixing Column Stats Index updating sequence (#5267) Alexey Kudinkin 2022-04-08 23:14:08 -07:00
  • 1cc7542357 [MINOR] Update README of docker build setup (#5256) Y Ethan Guo 2022-04-08 16:12:25 -07:00
  • 26eb7b8183 [HUDI-3571] Spark datasource continuous checkpoint should have own fs variable (#5265) satishm 2022-04-08 16:46:01 +05:30
  • d7cc767dbc [HUDI-3825] Fixing non-partitioned table Partition Records persistence in MT (#5259) Alexey Kudinkin 2022-04-08 03:28:31 -07:00
  • 67215abaf0 [HUDI-3827] Promote the inetAddress picking strategy for NetworkUtils#getHostname (#5260) Danny Chan 2022-04-08 14:33:56 +08:00
  • 7a6272fba1 [HUDI-3781] fix spark delete sql can not delete record (#5215) KnightChess 2022-04-08 14:26:40 +08:00
  • df87095ef0 [HUDI-3454] Fix partition name in all code paths for LogRecordScanner (#5252) Sagar Sumit 2022-04-08 09:59:36 +05:30