1
0

Commit Graph

  • c744848c59 [HUDI-4366] Synchronous cleaning for flink bounded source (#6051) Danny Chan 2022-07-08 09:55:07 +08:00
  • 5673819736 [HUDI-4309] fix spark32 repartition error (#6033) KnightChess 2022-07-08 09:38:09 +08:00
  • e74ad324c3 [HUDI-4152] Flink offline compaction support compacting multi compaction plan at once (#5677) Lanyuanxiaoyao 2022-07-07 14:11:26 +08:00
  • 7eeaff9ee0 [HUDI-4357] Support flink 1.15.x (#6050) Danny Chan 2022-07-06 13:42:58 +08:00
  • b18c32379f [HUDI-4219] Merge Into when update expression "col=s.col+2" on precombine cause exception (#5828) shenjiayu17 2022-07-06 09:10:35 +08:00
  • 3670e82af5 [HUDI-4356] Fix the error when sync hive in CTAS (#6029) 董可伦 2022-07-06 00:08:23 +08:00
  • 8570c3aab4 [HUDI-4359] Support show_fs_path_detail command on Call Produce Command (#6042) ForwardXu 2022-07-05 23:56:32 +08:00
  • 23c9c5c296 [HUDI-3836] Improve the way of fetching metadata partitions from table (#5286) xi chaomin 2022-07-05 22:50:17 +08:00
  • fbda4ad5bd [HUDI-4360] Fix HoodieDropPartitionsTool based on refactored meta sync (#6043) Y Ethan Guo 2022-07-04 23:37:21 -07:00
  • 45fdcf68a1 [HUDI-3116]Add a new HoodieDropPartitionsTool to let users drop table partitions through a standalone job. (#4459) YueZhang 2022-07-05 10:24:18 +08:00
  • 6187622178 [MINOR] Improve variable names (#6039) Shiyan Xu 2022-07-04 20:03:50 -05:00
  • c091e4cc30 [HUDI-3730] Add ConfigTool#toMap UT (#6035) voonhous 2022-07-05 06:07:19 +08:00
  • e0954040a9 [HUDI-3511] Add call procedure for MetadataCommand (#6018) superche 2022-07-03 21:44:56 +08:00
  • c0e1587966 [HUDI-3730] Improve meta sync class design and hierarchies (#5854) Shiyan Xu 2022-07-03 04:17:25 -05:00
  • c00ea84985 [HUDI-3505] Add call procedure for UpgradeOrDowngradeCommand (#6012) superche 2022-07-03 08:47:48 +08:00
  • 47792a3186 [HUDI-4353] Column stats data skipping for flink (#6026) Danny Chan 2022-07-03 08:29:31 +08:00
  • bdf73b2650 [HUDI-3953]Flink Hudi module should support low-level source and sink api (#5445) JerryYue-M 2022-07-02 08:38:46 +08:00
  • 62a0c962ac [HUDI-3634] Could read empty or partial HoodieCommitMetaData in downstream if using HDFS (#5048) RexAn 2022-07-01 02:07:40 +08:00
  • 397fd30142 [HUDI-3984] Remove mandatory check of partiton path for cli command (#5458) miomiocat 2022-07-01 01:00:13 +08:00
  • 8547899a39 [HUDI-4285] add ByteBuffer#rewind after ByteBuffer#get in AvroDeseria… (#5907) komao 2022-06-30 20:48:50 +08:00
  • cdaaa3c4c7 [HUDI-4346] Fix params not update BULKINSERT_ARE_PARTITIONER_RECORDS_SORTED (#5999) RexAn 2022-06-30 10:26:00 +08:00
  • 6a01f7029c [MINOR] Following #2070, Fix BindException when running tests on shared machines. (#5951) cxzl25 2022-06-30 10:20:59 +08:00
  • 3948b8935a [HUDI-4336] Fix records overwritten bug with binary primary key (#5996) luoyajun 2022-06-30 09:12:00 +08:00
  • 03a94d9ff5 [HUDI-4331] Allow loading external config file from class loader (#5987) wenningd 2022-06-29 17:04:34 -07:00
  • e71f04768e [MINOR] Make CLI 'commit rollback' using rollbackUsingMarkers false as default (#5174) YueZhang 2022-06-30 01:12:46 +08:00
  • 637660b7aa [HUDI-1575] Claim RFC-56: Early Conflict Detection For Multi-writer (#6002) YueZhang 2022-06-29 16:43:31 +08:00
  • e3eb14ad2d [HUDI-4334] close SparkRDDWriteClient after usage in Create/Delete/RollbackSavepointsProcedure (#5994) Teng 2022-06-29 06:13:29 +08:00
  • fd7d25ab63 [HUDI-1176] Upgrade hudi to log4j2 (#5366) bschell 2022-06-28 14:54:23 -05:00
  • ed823f1c6f [HUDI-4320] Make sure HoodieStorageConfig.PARQUET_WRITE_LEGACY_FORMAT_ENABLED could be specified by the writer (#5970) Alexey Kudinkin 2022-06-28 12:27:32 -07:00
  • efb9719018 [HUDI-4332] The current instant may be wrong under some extreme conditions in AppendWriteFunction. (#5988) BruceLin 2022-06-28 20:42:26 +08:00
  • 08eba914ed [HUDI-4333] fix HoodieFileIndex's listFiles method log print skipping percent NaN (#5990) ForwardXu 2022-06-28 15:08:48 +08:00
  • 09dc001430 [HUDI-4325] fix spark sql procedure cause ParseException with semicolon (#5982) KnightChess 2022-06-28 09:44:41 +08:00
  • b14ed47f21 [HUDI-3506] Add call procedure for CommitsCommand (#5974) superche 2022-06-28 09:43:36 +08:00
  • 8846849a03 [HUDI-4291] Fix flaky TestCleanPlanExecutor#testKeepLatestFileVersions (#5930) Sagar Sumit 2022-06-27 17:27:16 +05:30
  • 3a1fd22841 [HUDI-4311] Fix Flink lose data on some rollback scene (#5950) 吴祥平 2022-06-27 16:09:44 +08:00
  • 26c967bac6 [HUDI-3504] Support bootstrap command based on Call Produce Command (#5977) ForwardXu 2022-06-27 13:06:50 +08:00
  • 8f4e2a189e [HUDI-4315] Do not throw exception in BaseSpark3Adapter#toTableIdentifier (#5957) leesf 2022-06-27 12:50:58 +08:00
  • 72fa19bcc9 [HUDI-4316] Support for spillable diskmap configuration when constructing HoodieMergedLogRecordScanner (#5959) cxzl25 2022-06-27 11:09:30 +08:00
  • 7a6eb0f6e1 [HUDI-4309] Spark3.2 custom parser should not throw exception (#5947) cxzl25 2022-06-27 09:37:23 +08:00
  • 0a9e568ff5 [HUDI-5246] Bumping mysql connector version due to security vulnerability (#5851) Sivabalan Narayanan 2022-06-26 16:54:57 -07:00
  • 559b26fb7c [MINOR] Remove -T option from CI build (#5972) Shiyan Xu 2022-06-26 10:34:05 -05:00
  • 1c43c590ac [HUDI-3502] Support hdfs parquet import command based on Call Produce Command (#5956) ForwardXu 2022-06-26 11:27:14 +08:00
  • 142adf4ccb [HUDI-4296] Fix the bug that TestHoodieSparkSqlWriter.testSchemaEvolutionForTableType is flaky (#5973) xiarixiaoyao 2022-06-25 21:03:19 +08:00
  • c86edfc28e [HUDI-4319] Fixed Parquet's PLAIN_DICTIONARY encoding not being applied when bulk-inserting (#5966) Alexey Kudinkin 2022-06-24 20:52:28 -07:00
  • 360df576a9 Revert "[TEST][DO_NOT_MERGE]fix random failed for ci (#5948)" (#5971) xiarixiaoyao 2022-06-25 11:23:17 +08:00
  • e8fbd4daf4 [TEST][DO_NOT_MERGE]fix random failed for ci (#5948) xiarixiaoyao 2022-06-25 10:15:08 +08:00
  • eeafaeacd2 [HUDI-3512] Add call procedure for StatsCommand (#5955) jiz 2022-06-25 09:43:23 +08:00
  • 59978ef4a9 [HUDI-4260] Change KEYGEN_CLASS_NAME without default value (#5877) luokey 2022-06-24 15:05:03 +08:00
  • 30ebdc708b [HUDI-3735] TestHoodieSparkMergeOnReadTableRollback is flaky (#5874) xi chaomin 2022-06-24 14:47:36 +08:00
  • 6456bd3a51 [HUDI-4273] Support inline schedule clustering for Flink stream (#5890) Zhaojing Yu 2022-06-24 11:28:06 +08:00
  • af9f09047d [HUDI-3509] Add call procedure for HoodieLogFileCommand (#5949) jiz 2022-06-24 10:16:54 +08:00
  • eeb78f23e6 [HUDI-4290] Fix fetchLatestBaseFiles to filter replaced filegroups (#5941) Sagar Sumit 2022-06-23 19:40:08 +05:30
  • 38ff18a199 [HUDI-4299] Fix problem about hudi-example-java run failed on idea. (#5936) Forus 2022-06-23 21:46:22 +08:00
  • 1bb017d396 [HUDI-3508] Add call procedure for FileSystemViewCommand (#5929) jiz 2022-06-22 17:50:20 +08:00
  • 1dbd9d407a [minor] following 4270, add unit tests for the keys lost case (#5918) Danny Chan 2022-06-22 16:56:06 +08:00
  • c9590790f8 [HUDI-4279] Strength the remote fs view lagging check when latest commit refresh is enabled (#5917) LinMingQiang 2022-06-22 10:32:21 +08:00
  • c7e430bb46 Revert master (#5925) Zhaojing Yu 2022-06-21 16:58:50 +08:00
  • 092e35c1e3 udate 喻兆靖 2022-06-21 15:22:04 +08:00
  • 4640a3bbb8 [HUDI-3475] Initialize hudi table management module. 喻兆靖 2022-06-08 09:54:31 +08:00
  • 7c4aaa9715 [HUDI-4270] Bootstrap op data loading missing (#5888) Bo Cui 2022-06-21 11:47:39 +08:00
  • 5c204f1416 [HUDI-4177] Fix hudi-cli rollback with rollbackUsingMarkers method call (#5734) Shawn Chang 2022-06-20 19:54:12 -07:00
  • ba4d5bd847 [HUDI-4251] Fix the problem that the command 'commits sync' description does not match. (#5881) Forus 2022-06-21 07:03:58 +08:00
  • 17ac5a4573 [HUDI-4173] Fix wrong results if the user read no base files hudi table by glob paths (#5723) RexAn 2022-06-21 01:32:34 +08:00
  • 7601e9e4c7 [MINOR] Update DOAP with 0.11.1 Release (#5908) Y Ethan Guo 2022-06-20 09:27:35 -07:00
  • f1103281d2 [HUDI-4258] Fix when HoodieTable removes data file before the end of Flink job (#5876) Alexander Trushev 2022-06-20 16:07:49 +07:00
  • 7c6bedff25 [HUDI-4259] Flink create avro schema not conformance to standards (#5878) luokey 2022-06-20 15:41:23 +08:00
  • d7facb8cb8 fix remove redundant Variable (#5806) felixYyu 2022-06-20 15:21:49 +08:00
  • 7481eacf23 [HUDI-4277] supoort flink table source with computed column (#5897) Shizhi Chen 2022-06-20 15:19:32 +08:00
  • efafb79eeb [MINOR] Add "spillable_map_path" in FlinkCompactionConfig. To avoid the disk space of "/tmp" full when compacting offline. (#5905) 5herhom 2022-06-20 15:15:23 +08:00
  • d4f0326b4b [HUDI-4275] Refactor rollback inflight instant for clustering/compaction to reuse some code (#5894) huberylee 2022-06-20 14:29:21 +08:00
  • c5c4cfec91 [HUDI-3507] Support export command based on Call Produce Command (#5901) ForwardXu 2022-06-19 18:48:22 +08:00
  • fec49dc12b [HUDI-4165] Support Create/Drop/Show/Refresh Index Syntax for Spark SQL (#5761) huberylee 2022-06-17 18:33:58 +08:00
  • 7689e62cd9 [HUDI-4265] Deprecate useless targetTableName parameter in HoodieMultiTableDeltaStreamer (#5883) 董可伦 2022-06-17 16:57:14 +08:00
  • 0ff34b6974 [HUDI-4214] improve repeat init write schema in ExpressionPayload (#5820) KnightChess 2022-06-16 17:58:37 +08:00
  • 2bf0a1906d [HUDI-4217] improve repeat init object in ExpressionPayload (#5825) KnightChess 2022-06-15 20:21:28 +08:00
  • c291b05699 [HUDI-4218] [HUDI-4218] Expose the real exception information when an exception occurs in the tableExists method (#5827) 董可伦 2022-06-15 18:10:35 +08:00
  • 7b946cf351 [HUDI-3499] Add Call Procedure for show rollbacks (#5848) superche 2022-06-15 16:50:15 +08:00
  • 0811bb38fb [HUDI-4255] Make the flink merge and replace handle intermediate file visible (#5866) Danny Chan 2022-06-15 14:23:23 +08:00
  • 25bbff64cf [minor] Following HUDI-4207, remote the new wrapper #init method (#5865) Danny Chan 2022-06-15 08:48:13 +08:00
  • f16b1e8982 [MINOR] Fix typo of DisruptorExecutor in RFC 53 (#5860) felixYyu 2022-06-14 14:30:17 +08:00
  • 264b15df87 [HUDI-4207] HoodieFlinkWriteClient.getOrCreateWriteHandle throws an e… (#5788) HunterXHunter 2022-06-13 22:36:06 +08:00
  • 4774c4248f [HUDI-4006] failOnDataLoss on delta-streamer kafka sources (#5718) Qi Ji 2022-06-13 22:31:57 +08:00
  • 0d859fe58b [HUDI-3863] Add UT for drop partition column in deltastreamer testsuite (#5727) luoyajun 2022-06-13 22:29:32 +08:00
  • e89f5627e4 [HUDI-3682] testReaderFilterRowKeys fails in TestHoodieOrcReaderWriter (#5790) xi chaomin 2022-06-13 22:22:12 +08:00
  • 14d8735a1c Strip extra spaces when creating new configuration (#5849) superche 2022-06-13 19:10:38 +08:00
  • c82e3462e3 [MINOR] fix AvroSchemaConverter duplicate branch in 'switch' (#5813) sandyfog 2022-06-13 10:55:24 +08:00
  • 5aaac21d1d [HUDI-4224] Fix CI issues (#5842) Shiyan Xu 2022-06-12 11:44:18 -07:00
  • fd8f7c5f6c [HUDI-4205] Fix NullPointerException in HFile reader creation (#5841) Y Ethan Guo 2022-06-11 14:46:43 -07:00
  • 97ccf5dd18 [HUDI-4223] Fix NullPointerException from getLogRecordScanner when reading metadata table (#5840) Y Ethan Guo 2022-06-11 13:19:24 -07:00
  • 08fe281091 [HUDI-4221] Fixing getAllPartitionPaths perf hit w/ FileSystemBackedMetadata (#5829) Sivabalan Narayanan 2022-06-11 16:17:42 -04:00
  • 2b3a85528a [HUDI-3889] Do not validate table config if save mode is set to Overwrite (#5619) xi chaomin 2022-06-10 07:23:51 +08:00
  • ba47904fa2 [HUDI-4139]improvement for flink write operator name to identify tables easily (#5744) yanenze 2022-06-10 05:48:20 +08:00
  • c608dbd6c2 [HUDI-4213] Infer keygen clazz for Spark SQL (#5815) Danny Chan 2022-06-09 20:37:58 +08:00
  • 8ff17b0470 [MINOR] FlinkStateBackendConverter add more exception message (#5809) sandyfog 2022-06-09 15:13:27 +08:00
  • f5ab921300 [MINOR][DOCS] Update the README.md file in hudi-examples (#5803) liuzhuang2017 2022-06-09 08:45:00 +08:00
  • 35afdb4316 [HUDI-4178] Addressing performance regressions in Spark DataSourceV2 Integration (#5737) Alexey Kudinkin 2022-06-07 16:30:46 -07:00
  • 1349b596a1 [HUDI-4198] Fix hive config for AWSGlueClientFactory (#5768) Raymond Xu 2022-06-07 07:51:31 -07:00
  • f85cd9b16d [HUDI-4200] Fixing sorting of keys fetched from metadata table (#5773) Sivabalan Narayanan 2022-06-07 08:19:52 -04:00
  • 4f5cad8029 [MINOR][RFC-53] Fix typos (#5764) YueZhang 2022-06-07 08:28:28 +08:00
  • e5710a8e7c [MINOR] Mark AWSGlueCatalogSyncClient experimental (#5775) Raymond Xu 2022-06-06 17:25:59 -07:00