1
0

Commit Graph

  • 7da97c8096 [HUDI-4171] Fixing Non partitioned with virtual keys in read path (#5747) Sivabalan Narayanan 2022-06-06 15:48:21 -04:00
  • 21b903fddb [HUDI-4197] Fix Async indexer to support building FILES partition (#5766) Sivabalan Narayanan 2022-06-06 15:47:11 -04:00
  • 4f6fc726d0 [HUDI-4140] Fixing hive style partitioning and default partition with bulk insert row writer with SimpleKeyGen and virtual keys (#5664) Sivabalan Narayanan 2022-06-06 13:21:00 -04:00
  • 4f7ea8c79a [HUDI-4176] Fixing TableSchemaResolver to avoid repeated HoodieCommitMetadata parsing (#5733) Alexey Kudinkin 2022-06-06 10:14:26 -07:00
  • 132c0aa8c7 [HUDI-4101] When BucketIndexPartitioner take partition path for dispersion may cause the fileID of the task to not be loaded correctly (#5763) HunterXHunter 2022-06-06 21:53:55 +08:00
  • 21ab0ff8be [HUDI-4195] Bulk insert should use right keygen for non-partitioned table (#5759) Sagar Sumit 2022-06-06 16:49:03 +05:30
  • 22c45a7704 [HUDI-4188] Fix flaky ITTestDataSTreamWrite.testWriteCopyOnWrite (#5749) Danny Chan 2022-06-06 12:12:48 +08:00
  • 73b0be3c96 [HUDI-4192] HoodieHFileReader scan top cells after bottom cells throw NullPointerException (#5755) marchpure 2022-06-06 12:07:26 +08:00
  • 5d18b80343 [HUDI-4190] Include hbase-protocol for shading in the bundles (#5750) Y Ethan Guo 2022-06-05 17:42:16 -07:00
  • bd26d633d7 [HUDI-4168] Add Call Procedure for marker deletion (#5738) Saisai Shao 2022-06-05 11:05:38 +08:00
  • 80783c27f5 [HUDI-4187] Fix partition order in aws glue sync (#5731) Nicolas Paris 2022-06-04 11:16:52 +02:00
  • 3759a38b99 [HUDI-4183] Fix using HoodieCatalog to create non-hudi tables (#5743) leesf 2022-06-03 17:16:48 +08:00
  • 51602a34f7 [HUDI-4179] Cluster with sort cloumns invalid (#5739) KnightChess 2022-06-02 20:28:21 +08:00
  • 7f8630cc57 [HUDI-4167] Remove the timeline refresh with initializing hoodie table (#5716) Danny Chan 2022-06-02 09:48:48 +08:00
  • 7276d0eaa6 [HUDI-3670] free temp views in sql transformers (#5080) Qi Ji 2022-06-01 22:35:40 +08:00
  • dfcd6d9a86 [HUDI-4011] Add hudi-aws-bundle (#5674) Sagar Sumit 2022-06-01 18:00:29 +05:30
  • 0d069b5e57 [HUDI-4174] Add hive conf dir option for flink sink (#5725) Danny Chan 2022-06-01 16:17:36 +08:00
  • 795a99ba73 [HUDI-4107] Added --sync-tool-classes config option in HoodieMultiTableDeltaStreamer (#5597) Kumud Kumar Srivatsava Tirupati 2022-05-31 20:27:50 +05:30
  • 918c4f4e0b [HUDI-4149] Drop-Table fails when underlying table directory is broken (#5672) Jin Xing 2022-05-30 19:09:26 +08:00
  • 329da34ee0 [HUDI-4163] Catch general exception instead of IOException while fetching rollback plan during rollback (#5703) Danny Chan 2022-05-30 13:08:02 +08:00
  • 7e86884604 [HUDI-4086] Use CustomizedThreadFactory in async compaction and clustering (#5563) 苏承祥 2022-05-29 13:35:47 +08:00
  • 0a72458291 [HUDI-3551] Fix testStorageSchemes for oci storage (#5711) Raymond Xu 2022-05-28 12:13:37 -07:00
  • 62d792368b [HUDI-3551] Add the Oracle Cloud Infrastructure (oci) Object Storage URI scheme (#4952) Carter Shanklin 2022-05-28 08:26:14 -07:00
  • 48062a5708 [HUDI-4166] Added SimpleClient plugin for integ test (#5710) uday08bce 2022-05-28 17:20:52 +02:00
  • 8fa8f26031 [MINOR] Fix Hive and meta sync config for sql statement (#5316) ForwardXu 2022-05-28 22:56:39 +08:00
  • 58014c147a [HUDI-4160] Make database regex of MaxwellJsonKafkaSourcePostProcessor optional (#5697) wangxianghu 2022-05-28 11:13:24 +04:00
  • 93fe5a497e [HUDI-4151] flink split_reader supports rocksdb (#5675) Bo Cui 2022-05-28 08:37:34 +08:00
  • 554caa3421 [MINOR] Fix the issue when handling conf hoodie.datasource.write.operation=bulk_insert in sql mode (#5679) RexAn 2022-05-27 19:45:09 +08:00
  • 1767ff5e7c [HUDI-4161] Make sure partition values are taken from partition path (#5699) Alexey Kudinkin 2022-05-27 02:36:30 -07:00
  • 57dbe57bed [HUDI-4162] Fixed some constant mapping issues. (#5700) watermelon12138 2022-05-27 14:08:54 +08:00
  • 85962ee55d [HUDI-3963][RFC-53] Use Lock-Free Message Queue Disruptor Improving Hoodie Writing Efficiency (#5567) YueZhang 2022-05-26 23:03:09 +08:00
  • 8d2f009048 [HUDI-4124] Add valid check in Spark Datasource configs (#5637) komao 2022-05-26 20:21:28 +08:00
  • 31e13db1f0 [HUDI-4023] Decouple hudi-spark from hudi-utilities-slim-bundle (#5641) Sagar Sumit 2022-05-26 11:28:49 +05:30
  • 98c5c6c654 [HUDI-4040] Bulk insert Support CustomColumnsSortPartitioner with Row (#5502) RexAn 2022-05-26 13:09:04 +08:00
  • 4e42ed5eae [HUDI-4145] Archives the metadata file in HoodieInstant.State sequence (part2) (#5676) Danny Chan 2022-05-26 11:21:39 +08:00
  • cf837b4900 [HUDI-3193] Decouple hudi-aws from hudi-client-common (#5666) Sagar Sumit 2022-05-25 19:38:56 +05:30
  • a6bc9e8e81 [HUDI-4146] Claim RFC-55 for Improve Hive/Meta sync class design and hierachies (#5682) 冯健 2022-05-25 20:31:39 +08:00
  • f30b3aef3e [MINOR] Fix a potential NPE and some finer points of hudi cli (#5656) luoyajun 2022-05-25 02:13:18 +08:00
  • 18635b533e Merge pull request #3599 from yuzhaojing/HUDI-2207 Zhaojing Yu 2022-05-25 00:47:28 +08:00
  • 10363c1412 [HUDI-4132] Fixing determining target table schema for delta sync with empty batch (#5648) Sivabalan Narayanan 2022-05-24 08:17:15 -04:00
  • c20db99a7b [HUDI-2207] Support independent flink hudi clustering function 喻兆靖 2022-05-21 21:25:15 +08:00
  • 0caa55ecb4 [HUDI-4135] remove netty and netty-all (#5663) liujinhui 2022-05-24 18:56:28 +08:00
  • eb219010d2 [HUDI-4145] Archives the metadata file in HoodieInstant.State sequence (#5669) Danny Chan 2022-05-24 17:33:30 +08:00
  • c05ebf2417 [HUDI-2473] Fixing compaction write operation in commit metadata (#5203) Sivabalan Narayanan 2022-05-24 03:33:21 -04:00
  • 676d5cefe0 [HUDI-4138] Fix the concurrency modification of hoodie table config for flink (#5660) Danny Chan 2022-05-24 13:07:55 +08:00
  • af1128acf9 [HUDI-4084] Add support to test async table services with integ test suite framework (#5557) Sivabalan Narayanan 2022-05-23 23:05:56 -04:00
  • 47b764ec33 [HUDI-4134] Fix Method naming consistency issues in FSUtils (#5655) Heap 2022-05-24 06:28:48 +08:00
  • 716e995a38 [MINOR] Removing redundant semicolons and line breaks (#5662) felixYyu 2022-05-24 06:26:36 +08:00
  • 752f956f03 [HUDI-3933] Add UT cases to cover different key gen (#5638) Y Ethan Guo 2022-05-23 06:48:09 -07:00
  • 42c7129e25 [HUDI-4142] Claim RFC-54 for new table APIs (#5665) Sagar Sumit 2022-05-23 18:10:07 +05:30
  • 3ef137d156 [HUDI-4129] Initializes a new fs view for WriteProfile#reload (#5640) YuangZhang 2022-05-23 09:57:34 +08:00
  • 271d1a79c0 [HUDI-4051] Allow nested field as primary key and preCombineField in spark sql (#5517) Raymond Xu 2022-05-22 00:47:51 -07:00
  • 32a5d268f5 [HUDI-3890] fix rat plugin issue with sql files (#5644) uday08bce 2022-05-21 18:22:55 +02:00
  • 922f765ead [HUDI-4100] CTAS failed to clean up when given an illegal MANAGED table definition (#5588) Jin Xing 2022-05-21 22:41:18 +08:00
  • 8ec625d4d5 [HUDI-3858] Shade javax.servlet for Spark bundle jar (#5295) YueZhang 2022-05-21 21:16:14 +08:00
  • b5adba3e55 [MINOR] remove unused gson test dependency (#5652) Raymond Xu 2022-05-21 05:34:08 -07:00
  • 2af98303d3 [HUDI-4122] Fix NPE caused by adding kafka nodes (#5632) wangxianghu 2022-05-21 07:12:53 +04:00
  • 7d02b1fd3c [MINOR] Minor fixes to exception log and removing unwanted metrics flush in integ test (#5646) Sivabalan Narayanan 2022-05-20 19:27:35 -04:00
  • 85b146d3d5 [HUDI-3985] Refactor DLASyncTool to support read hoodie table as spark datasource table (#5532) huberylee 2022-05-20 22:25:32 +08:00
  • c7576f7613 [HUDI-4130] Remove the upgrade/downgrade for flink #initTable (#5642) Danny Chan 2022-05-20 21:31:23 +08:00
  • 1da0b21edd [HUDI-4119] the first read result is incorrect when Flink upsert- Kafka connector is used in HUDi (#5626) aliceyyan 2022-05-20 18:10:24 +08:00
  • 6f37863ba8 [HUDI-4114] Remove the unnecessary fs view sync for BaseWriteClient#initTable (#5617) Danny Chan 2022-05-19 10:59:05 +08:00
  • 6573469e73 [HUDI-4116] Unify clustering/compaction related procedures' output type (#5620) huberylee 2022-05-19 09:48:03 +08:00
  • 551aa959c5 Revert "[HUDI-3870] Add timeout rollback for flink online compaction (#5314)" (#5622) Danny Chan 2022-05-18 20:30:54 +08:00
  • 199f64255e [HUDI-4111] Bump ANTLR runtime version in Spark 3.x (#5606) cxzl25 2022-05-18 19:18:52 +08:00
  • 008616c4f6 [HUDI-3942] [RFC-50] Improve Timeline Server (#5392) Zhaojing Yu 2022-05-18 18:43:48 +08:00
  • a1017c66aa Clean the marker files for flink compaction (#5611) luokey 2022-05-18 11:21:14 +08:00
  • f1f8a1abb7 [HUDI-4109] Copy the old record directly when it is chosen for merging (#5603) Danny Chan 2022-05-18 10:17:00 +08:00
  • ebbe56e862 [minor] Some code refactoring for LogFileComparator and Instant instantiation (#5600) Danny Chan 2022-05-18 09:30:09 +08:00
  • f8b9399615 [MINOR] Fixing spark long running yaml for non-partitioned (#5607) Sivabalan Narayanan 2022-05-17 09:58:18 -04:00
  • 99555c897a [HUDI-4110] Clean the marker files for flink compaction (#5604) BruceLin 2022-05-17 21:09:27 +08:00
  • d422f69a0d [HUDI-4087] Support dropping RO and RT table in DropHoodieTableCommand (#5564) Jin Xing 2022-05-17 14:12:50 +08:00
  • d52d13302d [HUDI-4101] BucketIndexPartitioner should take partition path for better dispersion (#5590) Danny Chan 2022-05-17 10:34:57 +08:00
  • fdd96cc97e [HUDI-4104] DeltaWriteProfile includes the pending compaction file slice when deciding small buckets (#5594) Danny Chan 2022-05-17 10:34:15 +08:00
  • ad773b3d96 [HUDI-3654] Preparations for hudi metastore. (#5572) Shawy Geng 2022-05-17 09:47:10 +08:00
  • a7a42e4490 [HUDI-4103] [HUDI-4001] Filter the properties should not be used when create table for Spark SQL 董可伦 2022-05-16 23:26:23 +08:00
  • 43e08193ef [HUDI-4098] Metadata table heartbeat for instant has expired, last heartbeat 0 (#5583) Danny Chan 2022-05-16 17:40:08 +08:00
  • 61030d8e7a [HUDI-3123] consistent hashing index: basic write path (upsert/insert) (#4480) Yuwei XIAO 2022-05-16 11:07:01 +08:00
  • 1fded18dff fix hive sync no partition table error (#5585) 陈浩 2022-05-16 09:51:24 +08:00
  • 75f847691f [HUDI-4001] Filter the properties should not be used when create table for Spark SQL (#5495) 董可伦 2022-05-16 09:50:29 +08:00
  • 6e16e719cd [HUDI-3980] Suport kerberos hbase index (#5464) xi chaomin 2022-05-14 19:37:31 +08:00
  • 52e63b39d6 [HUDI-4097] add table info to jobStatus (#5529) wqwl611 2022-05-14 09:01:15 +08:00
  • 5c4813f101 [HUDI-4072] Fix NULL schema for empty batches in deltastreamer (#5543) Sivabalan Narayanan 2022-05-13 08:26:47 -04:00
  • a704e3740c [HUDI-3336][HUDI-FLINK]Support custom hadoop config for flink (#5574) Bo Cui 2022-05-13 19:52:55 +08:00
  • 7fb436d3cf [HUDI-4078][HUDI-FLINK]BootstrapOperator contains the pending compact… (#5545) Bo Cui 2022-05-13 14:32:48 +08:00
  • 8ad0bb9745 [MINOR] Fix a NPE for Option (#5461) Xingcan Cui 2022-05-13 00:20:40 -04:00
  • 701f8c039d [HUDI-3336][HUDI-FLINK]Support custom hadoop config for flink (#5528) Bo Cui 2022-05-13 09:50:11 +08:00
  • 0cec955fa2 [HUDI-4018][HUDI-4027] Adding integ test yamls for immutable use-cases. Added delete partition support to integ tests (#5501) Sivabalan Narayanan 2022-05-12 21:01:55 -04:00
  • ecd47e7aae [HUDI-3963][Claim RFC number 53] Use Lock-Free Message Queue Improving Hoodie Writing Efficiency. (#5562) YueZhang 2022-05-12 19:26:00 +08:00
  • b10ca7e69f [HUDI-4085] Fixing flakiness with parquet empty batch tests in TestHoodieDeltaStreamer (#5559) Sivabalan Narayanan 2022-05-11 16:02:54 -04:00
  • 7f0c1f3ddf [HUDI-4079] Supports showing table comment for hudi with spark3 (#5546) Jin Xing 2022-05-11 22:28:58 +08:00
  • 4a8589f222 [HUDI-4038] Avoid calling getDataSize after every record written (#5497) Alexey Kudinkin 2022-05-11 05:08:31 -07:00
  • 4258a71517 [HUDI-4003] Try to read all the log file to parse schema (#5473) Lanyuanxiaoyao 2022-05-11 06:45:53 +08:00
  • 6fd21d0f10 [HUDI-4044] When reading data from flink-hudi to external storage, the … (#5516) aliceyyan 2022-05-10 10:25:13 +08:00
  • 6285a239a3 [HUDI-3995] Making perf optimizations for bulk insert row writer path (#5462) Sivabalan Narayanan 2022-05-09 12:40:22 -04:00
  • 6b47ef6ed2 [HUDI-4053] Flaky ITTestHoodieDataSource.testStreamWriteBatchReadOpti… (#5526) xicm 2022-05-09 16:35:50 +08:00
  • 4c70840275 [MINOR] Fixing close for HoodieCatalog's test (#5531) ForwardXu 2022-05-09 15:17:24 +08:00
  • 75eaa0bffe [HUDI-4055]refactor ratelimiter to avoid stack overflow (#5530) guanziyue 2022-05-09 10:27:37 +08:00
  • 569a76a9a5 [MINOR] fixing flaky tests in deltastreamer tests (#5521) Sivabalan Narayanan 2022-05-07 15:37:20 -04:00
  • 80f99893a0 [MINOR] Fixing class not found when using flink and enable metadata table (#5527) BruceLin 2022-05-07 20:03:18 +08:00