1
0

Commit Graph

  • b5f05fd153 [HUDI-2906] Add a repair util to clean up dangling data and log files (#4278) Y Ethan Guo 2021-12-11 00:16:05 -08:00
  • 2dcb3f0062 [HUDI-2985] Shade jackson for hudi flink bundle jar (#4284) Danny Chan 2021-12-11 14:40:57 +08:00
  • 9bdcee00c0 [HUDI-2959] Fix the thread leak of cleaning service (#4252) Danny Chan 2021-12-11 12:08:47 +08:00
  • 9797fdfbb2 [HUDI-2974] Make the prefix for metrics name configurable (#4274) rmahindra123 2021-12-10 19:42:20 -08:00
  • c48a2a125a [HUDI-2527] Multi writer test with conflicting async table services (#4046) Manoj Govindassamy 2021-12-10 17:01:19 -08:00
  • 2d864f7524 [HUDI-2814] Make Z-index more generic Column-Stats Index (#4106) Alexey Kudinkin 2021-12-10 14:56:09 -08:00
  • 72901a33a1 [HUDI-2784] Add a hudi-trino-bundle for Trino (#4279) Y Ethan Guo 2021-12-10 14:27:22 -08:00
  • 3ba2909690 [HUDI-2892][BUG] Pending Clustering may stain the ActiveTimeLine and lead to incomplete query results (#4172) zhangyue19921010 2021-12-11 01:57:01 +08:00
  • 3ce0526924 Adding verbose output for metadata validate files command (#4166) Sivabalan Narayanan 2021-12-10 12:38:38 -05:00
  • 3ad9b121f1 [HUDI-2912] Fix CompactionPlanOperator typo (#4187) yuzhaojing 2021-12-11 01:32:53 +08:00
  • be368264f4 [HUDI-2952] Fixing metadata table for non-partitioned dataset (#4243) Sivabalan Narayanan 2021-12-10 11:11:42 -05:00
  • f194566ed4 [HUDI-2849] Improve SparkUI job description for write path (#4222) Yuwei XIAO 2021-12-10 23:22:37 +08:00
  • c7473a7b0c [HUDI-2936] Add data count checks in async clustering tests (#4236) Sagar Sumit 2021-12-10 19:55:37 +05:30
  • 456d74ce4e [HUDI-2901] Fixed the bug clustering jobs cannot running in parallel (#4178) xiarixiaoyao 2021-12-10 14:39:35 +08:00
  • ea154bcb5d Revert "Claiming RFC for data skipping index for updated version (#4271)" (#4272) Sivabalan Narayanan 2021-12-10 00:46:26 -05:00
  • 8321d20c2c Claiming RFC for data skipping index for updated version (#4271) Sivabalan Narayanan 2021-12-09 23:37:42 -05:00
  • 3fb2f974ca [MINOR] FAQ link in SUPPORT_REQUEST template (#4266) arunkc 2021-12-10 04:13:36 +05:30
  • 68f8597b12 [HUDI-2966] Add TaskCompletionListener for HoodieMergeOnReadRDD to close logScaner when the query finished. (#4265) xiarixiaoyao 2021-12-09 19:51:49 +08:00
  • f612a20815 [HUDI-2779] Cache BaseDir if HudiTableNotFound Exception thrown (#4014) RexAn 2021-12-09 18:34:11 +08:00
  • 5ac9ce7289 [MINOR] Fix Compile broken (#4263) leesf 2021-12-09 13:12:18 +08:00
  • 9c8ad0f0fa [HUDI-2665] Fix overflow of huge log file in HoodieLogFormatWriter (#3912) guanziyue 2021-12-09 10:47:13 +08:00
  • bd08470421 [HUDI-2957] Shade kryo jar for flink bundle jar (#4251) Danny Chan 2021-12-09 10:16:42 +08:00
  • 7c3f0777aa [HUDI-2964] Fixing aws lock configs to inherit from HoodieConfig (#4258) Sivabalan Narayanan 2021-12-08 19:17:56 -05:00
  • 082faa3851 [HUDI-2832][RFC-41] Proposal to integrate Hudi on Snowflake platform (#4074) Vinoth Govindarajan 2021-12-08 11:27:19 -08:00
  • c56d93e7b8 [MINOR] Update DOAP with 0.10.0 Release (#4246) Danny Chan 2021-12-08 17:55:22 +08:00
  • c9e18d1e7d [HUDI-2942] add error message log in HoodieCombineHiveInputFormat (#4224) xuzifu666 2021-12-08 14:05:39 +08:00
  • e8473b9a2b [HUDI-2951] Disable remote view storage config for flink (#4237) Danny Chan 2021-12-07 18:04:15 +08:00
  • 6dab307e6f [MINOR] Remove redundant and conflicting spark-hive dependency (#4228) Sagar Sumit 2021-12-07 07:18:32 +05:30
  • 4a437f25d3 [MINOR] Use maven-shade-plugin version for hudi-timeline-server-bundle from main pom.xml (#4209) wenningd 2021-12-06 15:29:18 -05:00
  • 2d66451a51 [MINOR] Fix partition path formatting in error log (#4168) Y Ethan Guo 2021-12-06 11:11:44 -08:00
  • 57c4bf8152 [HUDI-2876] for hive/presto hudi should remove the temp file which created by HoodieMergedLogRecordSanner when the query finished. (#4139) xiarixiaoyao 2021-12-06 21:33:10 +08:00
  • 84b531ae75 [HUDI-2900] Fix corrupt block end position (#4181) Ron 2021-12-06 20:38:39 +08:00
  • f0e46bf522 [HUDI-2916] Add IssueNavigationLink for IDEA (#4192) leesf 2021-12-06 14:53:54 +08:00
  • 734c9f5f2d [HUDI-2418] Support HiveSchemaProvider (#3671) 冯健 2021-12-05 16:10:13 +08:00
  • 63b15607ff [HUDI-2937] Introduce a pulsar implementation of hoodie write commit … (#4217) ForwardXu 2021-12-05 15:51:06 +08:00
  • a8fb69656f [HUDI-2877] Support flink catalog to help user use flink table conveniently (#4153) Ron 2021-12-05 10:14:29 +08:00
  • 36b69d8033 [HUDI-2935] Remove special casing of clustering in deltastreamer checkpoint retrival (#4216) vinoth chandar 2021-12-04 01:16:11 -08:00
  • 568181a3e7 [HUDI-2934] Optimize RequestHandler code style fengli 2021-12-04 13:56:25 +08:00
  • 1d4fb827e7 [HUDI-2923] Fixing metadata table reader when metadata compaction is inflight (#4206) Sivabalan Narayanan 2021-12-04 00:44:50 -05:00
  • 94f45e928c [HUDI-2890] Kafka Connect: Fix failed writes and avoid table service concurrent operations (#4211) rmahindra123 2021-12-03 21:30:32 -08:00
  • 0fd6b2d71e [HUDI-2933] DISABLE Metadata table by default (#4213) vinoth chandar 2021-12-03 21:12:35 -08:00
  • a799fae316 [MINOR] Mitigate CI jobs timeout issues (#4173) Raymond Xu 2021-12-03 21:08:32 -08:00
  • 5616830ae1 Revert "[HUDI-2489]Tuning HoodieROTablePathFilter by caching hoodieTableFileSystemView, aiming to reduce unnecessary list/get requests" zhangyue19921010 2021-12-04 10:56:53 +08:00
  • 383d5edc16 [HUDI-2894][HUDI-2905] Metadata table - avoiding key lookup failures on base files over S3 (#4185) Manoj Govindassamy 2021-12-03 11:18:10 -08:00
  • 2f96f4300b Revert "[HUDI-2495] Resolve inconsistent key generation for timestamp types by GenericRecord and Row (#3944)" (#4201) Yann Byron 2021-12-04 00:13:38 +08:00
  • bed7f9897a [HUDI-2911] Removing default value for PARTITIONPATH_FIELD_NAME resulting in incorrect KeyGenerator configuration (#4195) Alexey Kudinkin 2021-12-03 04:33:38 -08:00
  • e483f7c776 [HUDI-2902] Fixing populate meta fields with Hfile writers and Disabling virtual keys by default for metadata table (#4194) Sivabalan Narayanan 2021-12-03 07:20:21 -05:00
  • ca427240c0 [MINOR] use catalog schema if can not find table schema (#4182) Yann Byron 2021-12-03 16:37:13 +08:00
  • 0699521f83 [HUDI-2924] Refresh the fs view on successful checkpoints for write profile (#4199) Danny Chan 2021-12-03 16:12:59 +08:00
  • f74b3d12aa [minor] Refactor write profile to always generate fs view (#4198) Danny Chan 2021-12-03 11:38:29 +08:00
  • 934fe54cc5 [HUDI-2914] Fix remote timeline server config for flink (#4191) Danny Chan 2021-12-03 08:59:10 +08:00
  • 91d2e61433 [HUDI-2904] Fix metadata table archival overstepping between regular writers and table services (#4186) rmahindra123 2021-12-02 10:32:26 -08:00
  • 61a03bc072 [MINOR] Fix the wrong usage of timestamp length variable bug (#4179) zzzhy 2021-12-02 22:47:31 +08:00
  • 772f5ca24e Fixed partitions produced by layout optimization in case order-by key is composed of a single column (#4183) Alexey Kudinkin 2021-12-01 20:56:04 -08:00
  • 5284730175 [HUDI-2881] Compact the file group with larger log files to reduce write amplification (#4152) Shawy Geng 2021-12-02 09:41:04 +08:00
  • f4c25ba3fd [HUDI-2880] Fixing loading of props from default dir (#4167) Sivabalan Narayanan 2021-12-01 03:02:30 -05:00
  • 9b254b6fc5 Revert "[HUDI-2856] Bit cask disk map delete modified (#4116)" (#4171) Y Ethan Guo 2021-11-30 22:08:44 -08:00
  • 24380c2060 Revert "[HUDI-2855] Change the default value of 'PAYLOAD_CLASS_NAME' to 'DefaultHoodieRecordPayload' (#4115)" (#4169) Alexey Kudinkin 2021-11-30 17:47:16 -08:00
  • ea009b55a3 [HUDI-2891] Fix write configs for Java engine in Kafka Connect Sink (#4161) Y Ethan Guo 2021-11-30 06:45:50 -08:00
  • a398aad1fc [HUDI-2642] Add support ignoring case in update sql operation (#3882) 董可伦 2021-11-30 14:36:36 +08:00
  • 3433f00cb5 [MINOR] Fix typo,rename 'getUrlEncodePartitoning' to 'getUrlEncodePartitioning' (#4130) 董可伦 2021-11-30 10:31:22 +08:00
  • 536af4b954 [MINOR] Fix syntax error in create_source_release.sh (#4150) Danny Chan 2021-11-29 14:17:24 +08:00
  • 38e75ea806 Removing rfc from release package and fixing release validation script (#4147) Sivabalan Narayanan 2021-11-29 00:18:35 -05:00
  • 52aae36b53 [MINOR] Fixing integ test suite for hudi-aws and archival validation (#4142) Sivabalan Narayanan 2021-11-28 20:11:50 -05:00
  • eca1693288 [MINOR] fix typo (#4140) vortual 2021-11-28 17:13:50 +08:00
  • a1d0ff4209 Moving to 0.11.0-SNAPSHOT on master branch. yuzhao.cyz 2021-11-27 17:22:10 +08:00
  • 780a2ac5b2 [HUDI-2102] Support hilbert curve for hudi (#3952) xiarixiaoyao 2021-11-27 15:20:19 +08:00
  • 2c7656c35f [HUDI-2475] [HUDI-2862] Metadata table creation and avoid bootstrapping race for write client & add locking for upgrade (#4114) Manoj Govindassamy 2021-11-26 23:19:26 -08:00
  • 3a8d64e584 [HUDI-2868] Fix skipped HoodieSparkSqlWriterSuite (#4125) Raymond Xu 2021-11-26 19:59:20 -08:00
  • 9c059ef8e5 [MINOR] Follow ups from HUDI-2861 (re-use same rollback instant for failed rollback) (#4133) Sivabalan Narayanan 2021-11-26 19:22:53 -05:00
  • 257a6a7456 [HUDI-2856] Bit cask disk map delete modified (#4116) xuzifu666 2021-11-27 07:11:01 +08:00
  • 9028e6e1e4 [HUDI-2864] Fix README and scripts with current limitations of hive sync (#4129) rmahindra123 2021-11-26 15:09:32 -08:00
  • 8402cac407 [HUDI-2848] Excluse guava from hudi-cli pom (#4100) huleilei 2021-11-27 05:56:03 +08:00
  • 445208a0d2 [HUDI-2845] Metadata CLI - files/partition file listing fix and new validate option (#4092) Manoj Govindassamy 2021-11-26 13:44:16 -08:00
  • d1e83e4ba0 [HUDI-2767] Enabling timeline-server-based marker as default (#4112) Y Ethan Guo 2021-11-26 13:41:05 -08:00
  • f8e0176eb0 [HUDI-2861] Re-use same rollback instant time for failed rollbacks (#4123) Sivabalan Narayanan 2021-11-26 16:36:42 -05:00
  • a88691fed3 [MINOR] Fixing test failure to fix CI build failure (#4132) Sivabalan Narayanan 2021-11-26 13:50:10 -05:00
  • 5755ff25a4 [HUDI-2814] Addressing issues w/ Z-order Layout Optimization (#4060) Alexey Kudinkin 2021-11-26 10:02:15 -08:00
  • 3d75aca40d [HUDI-2850] Fixing Clustering CLI - schedule and run command fixes to avoid NumberFormatException (#4101) Manoj Govindassamy 2021-11-26 04:17:23 -08:00
  • e9efbdb63c [HUDI-2863] Rename option 'hoodie.parquet.page.size' to 'write.parquet.page.size' (#4128) Danny Chan 2021-11-26 16:40:53 +08:00
  • e554c7f468 [HUDI-2852] Table metadata returns empty for non-exist partition (#4117) mincwang 2021-11-26 16:24:03 +08:00
  • f5da9b50fa [MINOR] Include hudi-aws in flink bundle jar (#4127) Danny Chan 2021-11-26 14:36:44 +08:00
  • 38585e4e57 [HUDI-2851] Shade org.apache.hadoop.hive.ql.optimizer package for flink bundle jar (#4104) Ron 2021-11-26 11:27:21 +08:00
  • 8340ccb503 [HUDI-2005] Removing direct fs call in HoodieLogFileReader (#3865) Sivabalan Narayanan 2021-11-25 18:51:38 -05:00
  • 6f5d8d04cd [HUDI-2840] Fixed DeltaStreaemer to properly respect configuration passed t/h properties file (#4090) Alexey Kudinkin 2021-11-25 14:48:22 -08:00
  • e0125a7911 [HUDI-2801] Add Amazon CloudWatch metrics reporter (#4081) Udit Mehrotra 2021-11-25 13:33:16 -08:00
  • 8e1379384a [HUDI-2841] Fixing lazy rollback for MOR with list based strategy (#4110) Sivabalan Narayanan 2021-11-25 16:06:04 -05:00
  • 6a0f079866 [HUDI-2858] Fixing handling of cluster update reject exception in deltastreamer (#4120) Sivabalan Narayanan 2021-11-25 14:34:07 -05:00
  • f692078d32 [HUDI-2671] Making error -> warn logs from timeline server with concurrent writers for inconsistent state (#4088) Sivabalan Narayanan 2021-11-25 14:21:32 -05:00
  • 7bb90e8caf [HUDI-2794] Guarding table service commits within a single lock to commit to both data table and metadata table (#4037) Sivabalan Narayanan 2021-11-25 14:19:30 -05:00
  • b972aa5bf2 [HUDI-2800] Remove rdd.isEmpty() validation to prevent CreateHandle being called twice (#4121) Sagar Sumit 2021-11-25 23:46:36 +05:30
  • 264e1ce63c [HUDI-1290] fixing mysql debezium source (#4119) satishm 2021-11-25 21:56:59 +05:30
  • a2eb2b0b0a [HUDI-2480] FileSlice after pending compaction-requested instant-time… (#3703) Danny Chan 2021-11-25 22:30:09 +08:00
  • 88067f57a2 [HUDI-2855] Change the default value of 'PAYLOAD_CLASS_NAME' to 'DefaultHoodieRecordPayload' (#4115) 董可伦 2021-11-25 19:17:38 +08:00
  • a9bd20804b [HUDI-2792] Configure metadata payload consistency check (#4035) Sivabalan Narayanan 2021-11-24 21:56:31 -05:00
  • 83f8ed2ae3 [HUDI-1290] Add Debezium Source for deltastreamer (#4063) rmahindra123 2021-11-24 17:57:02 -08:00
  • abc0175cf7 [HUDI-1290] [RFC-39] Deltastreamer avro source for Debezium CDC (#4048) rmahindra123 2021-11-24 17:31:34 -08:00
  • bef373fa1d [MINOR] Fix build failure due to checkstyle issues (#4111) Y Ethan Guo 2021-11-24 17:17:46 -08:00
  • 51297736ca [HUDI-2844][CLI] Fixing archived Timeline crashing if timeline contains REPLACE_COMMIT (#4091) Alexey Kudinkin 2021-11-24 16:53:29 -08:00
  • 7286b56d30 [HUDI-2853] Add JMX deps in hudi utilities and kafka connect bundles (#4108) rmahindra123 2021-11-24 16:03:01 -08:00