1
0

Commit Graph

  • d4c2974eae MINOR fix method use error (#3467) liujinhui 2021-08-13 19:59:51 +08:00
  • 6a4100bb91 [MINOR] Tweak change log more as FULL for flink streaming source (#3466) Danny Chan 2021-08-13 16:31:16 +08:00
  • 6602e55cd2 [HUDI-2279]Support column name matching for insert * and update set * in merge into (#3415) 董可伦 2021-08-13 14:10:07 +08:00
  • 0544d70d8f [MINOR] Deprecate older configs (#3464) Sagar Sumit 2021-08-13 09:01:04 +05:30
  • 76bc686a77 [HUDI-1292] Created a config to enable/disable syncing of metadata table. (#3427) Prashant Wason 2021-08-12 15:45:57 -07:00
  • b651336454 [HUDI-2294] Adding virtual keys support to deltastreamer (#3450) Sivabalan Narayanan 2021-08-12 08:02:39 -04:00
  • c0fc9cdaf3 MINOR (#3459) liujinhui 2021-08-12 18:19:05 +08:00
  • dc3cbb28e7 [MINOR] Correct TestKafkaSource class and comment (#3451) vinoyang 2021-08-12 09:11:00 +08:00
  • b3e430f24b [HUDI-2017] Add API to set a metric in the registry. (#3084) Prashant Wason 2021-08-11 16:47:16 -07:00
  • 9e8308527a [HUDI-1518] Remove the logic that delete replaced file when archive (#3310) zhangyue19921010 2021-08-12 01:54:44 +08:00
  • 4783176554 [HUDI-1138] Add timeline-server-based marker file strategy for improving marker-related latency (#3233) Y Ethan Guo 2021-08-11 08:48:13 -07:00
  • 29332498af [HUDI-2298] The HoodieMergedLogRecordScanner should set up the operation of the chosen record (#3456) Danny Chan 2021-08-11 22:55:43 +08:00
  • aa11989ead [HUDI-2286] Handle the case of failed deltacommit on the metadata table. (#3428) Prashant Wason 2021-08-11 07:39:48 -07:00
  • c9fa3cffaf [HUDI-1774] Adding support for delete_partitions to spark data source (#3437) Sivabalan Narayanan 2021-08-11 01:03:01 -04:00
  • a5e496fe23 [HUDI-2292] MOR should not predicate pushdown when reading with payload_combine type (#3443) Shawy Geng 2021-08-11 12:17:39 +08:00
  • 8255a86cb4 [HUDI-1939] remove joda time in hivesync module (#3430) Raymond Xu 2021-08-10 20:25:41 -07:00
  • 5448cdde7e [HUDI-2170] [HUDI-1763] Always choose the latest record for HoodieRecordPayload (#3401) swuferhong 2021-08-11 10:20:55 +08:00
  • d1b4aa59bf [HUDI-2042] Compare the field object directly in OverwriteWithLatestAvroPayload (#3108) Shawy Geng 2021-08-11 05:48:53 +08:00
  • abbc8328e6 [MINOR] Fix contribution link in PULL_REQUEST_TEMPLATE (#3425) Damon P. Cortesi 2021-08-10 13:01:45 -07:00
  • 0e1c592c69 [MINOR] Delete useless com.uber.hoodie.hadoop.hive.HoodieCombineHiveInputFormat (#3298) vinoyang 2021-08-11 03:05:31 +08:00
  • 1196736185 [HUDI-1129] Improving schema evolution support in hudi (#2927) Sivabalan Narayanan 2021-08-10 12:15:37 -04:00
  • 73d898322b [MINOR] Fix travis from errors (#3432) zhangyue19921010 2021-08-10 23:25:49 +08:00
  • a18bc839d1 [HUDI-2288] Support storage on ks3 for hudi (#3434) xuzifu666 2021-08-10 23:18:12 +08:00
  • 21db6d7a84 [HUDI-1771] Propagate CDC format for hoodie (#3285) swuferhong 2021-08-10 20:23:23 +08:00
  • b4441abcf7 [HUDI-2194] Skip the latest N partitions when choosing partitions to create ClusteringPlan (#3300) zhangyue19921010 2021-08-10 01:10:15 +08:00
  • 41a9986a76 [HUDI-2208] Support Bulk Insert For Spark Sql (#3328) pengzhiwei 2021-08-09 12:18:31 +08:00
  • 11ea74958d [HUDI-2247] Filter file where length less than parquet MAGIC length (#3363) yuzhaojing 2021-08-09 09:15:42 +08:00
  • 32a50d8ddb [HUDI-2243] Support Time Travel Query For Hoodie Table (#3360) pengzhiwei 2021-08-08 07:07:22 +08:00
  • 55d2e786db [HUDI-1842] Spark Sql Support For pre-existing Hoodie Table (#3393) pengzhiwei 2021-08-07 19:49:26 +08:00
  • 70b6bd485f [HUDI-1468] Support custom clustering strategies and preserve commit metadata as part of clustering (#3419) Sagar Sumit 2021-08-07 08:23:08 +05:30
  • 9ce548edb1 [MINOR] fix compile error in compaction command (#3421) pengzhiwei 2021-08-06 16:18:19 +08:00
  • 3f8ca1a355 [HUDI-2182] Support Compaction Command For Spark Sql (#3277) pengzhiwei 2021-08-06 15:12:10 +08:00
  • 20feb1a897 [HUDI-2278] Use INT64 timestamp with precision 3 for flink parquet writer (#3414) Danny Chan 2021-08-06 11:06:21 +08:00
  • b7586a5632 [HUDI-2274] Allows INSERT duplicates for Flink MOR table (#3403) Danny Chan 2021-08-06 10:30:52 +08:00
  • 0dcd6a8fca [HUDI-2233] Use HMS To Sync Hive Meta For Spark Sql (#3387) pengzhiwei 2021-08-05 21:57:22 +08:00
  • 1df5ded433 [HUDI-2273] Migrating some long running tests to functional test profile (#3398) Sivabalan Narayanan 2021-08-04 19:08:50 -04:00
  • 5574e092fb [HUDI-2232] [SQL] MERGE INTO fails with table having nested struct (#3379) pengzhiwei 2021-08-04 18:20:29 +08:00
  • b8b9d6db83 [HUDI-2087] Support Append only in Flink stream (#3390) yuzhaojing 2021-08-04 17:53:20 +08:00
  • 02331fc223 [HUDI-2258] Metadata table for flink (#3381) Danny Chan 2021-08-04 10:54:55 +08:00
  • b4c14eaa29 [HUDI-2090] Ensure Disk Maps create a subfolder with appropriate prefixes and cleans them up on close (#3329) rmahindra123 2021-08-03 17:51:25 -07:00
  • 91bb0d1318 [HUDI-2255] Refactor Datasource options (#3373) wenningd 2021-08-03 17:50:30 -07:00
  • 1ff2d3459a [HUDI-1371] [HUDI-1893] Support metadata based listing for Spark DataSource and Spark SQL (#2893) Udit Mehrotra 2021-08-03 14:47:40 -07:00
  • 245e1fd17d [HUDI-2272] Pass base file format to sync clients (#3397) rmahindra123 2021-08-03 14:46:02 -07:00
  • 826a04d142 [HUDI-2072] Add pre-commit validator framework (#3153) satishkotha 2021-08-03 12:07:45 -07:00
  • bec23bda50 [HUDI-2269] Release the disk map resource for flink streaming reader (#3384) Danny Chan 2021-08-03 13:55:35 +08:00
  • aa857beee0 [HUDI-2225] Add a compaction job in hudi-examples (#3347) Sagar Sumit 2021-08-03 09:01:56 +05:30
  • b21ae68e67 [MINOR] Improving runtime of TestStructuredStreaming by 2 mins (#3382) vinoth chandar 2021-08-02 13:42:46 -07:00
  • fe508376fa [HUDI-2177][HUDI-2200] Adding virtual keys support for MOR table (#3315) Sivabalan Narayanan 2021-08-02 09:45:09 -04:00
  • dde57b293c [HUDI-2164] Let users build cluster plan and execute this plan at once using HoodieClusteringJob for async clustering (#3259) zhangyue19921010 2021-08-02 08:07:59 +08:00
  • 6353fc865f [HUDI-2218] Fix missing HoodieWriteStat in HoodieCreateHandle (#3341) Gary Li 2021-07-30 17:36:57 +08:00
  • f7f5d4cc6d [HUDI-2184] Support setting hive sync partition extractor class based on flink configuration (#3284) swuferhong 2021-07-30 17:24:00 +08:00
  • c4e45a0010 [HUDI-2254] Builtin sort operator for flink bulk insert (#3372) Danny Chan 2021-07-30 16:58:11 +08:00
  • 8b19ec9ca0 [HUDI-2252] Default consumes from the latest instant for flink streaming reader (#3368) swuferhong 2021-07-30 14:25:05 +08:00
  • 7bdae69053 [HUDI-2253] Refactoring few tests to reduce runningtime. DeltaStreamer and MultiDeltaStreamer tests. Bulk insert row writer tests (#3371) Sivabalan Narayanan 2021-07-30 01:22:26 -04:00
  • c2370402ea [HUDI-2251] Fix Exception Cause By Table Name Case Sensitivity For Append Mode Write (#3367) pengzhiwei 2021-07-30 05:36:56 +08:00
  • 44e41dc9bb [HUDI-2117] Unpersist the input rdd after the commit is completed to … (#3207) Shawy Geng 2021-07-29 23:16:58 +08:00
  • f109c6cb0d [MINOR] fix check style error (#3365) pengzhiwei 2021-07-29 17:29:10 +08:00
  • bbadac7de1 [HUDI-1425] Performance loss with the additional hoodieRecords.isEmpty() in HoodieSparkSqlWriter#write (#2296) pengzhiwei 2021-07-29 12:30:18 +08:00
  • efbbb67420 [HUDI-2241] Explicit parallelism for flink bulk insert (#3357) Danny Chan 2021-07-29 09:57:37 +08:00
  • 7739518879 [HUDI-2228] Add option 'hive_sync.mode' for flink writer (#3352) swuferhong 2021-07-28 19:45:50 +08:00
  • eedfadeb46 [HUDI-2244] Fix database alreadyExists exception while hive sync (#3361) swuferhong 2021-07-28 19:40:16 +08:00
  • 91c2213412 [HUDI-2245] BucketAssigner generates the fileId evenly to avoid data skew (#3362) Danny Chan 2021-07-28 19:26:37 +08:00
  • 8105cf588e [HUDI-2230] Make codahale times transient to avoid serializable exceptions (#3345) davehagman 2021-07-28 02:45:09 -04:00
  • 8fef50e237 [HUDI-2044] Integrate consumers with rocksDB and compression within External Spillable Map (#3318) rmahindra123 2021-07-27 22:31:03 -07:00
  • 00cd35f90a [HUDI-2215] Add rateLimiter when Flink writes to hudi. (#3338) mincwang 2021-07-28 08:23:23 +08:00
  • 60758b36ea [HUDI-2227] Only sync hive meta on successful commit for flink batch writer (#3351) Danny Chan 2021-07-27 20:10:08 +08:00
  • 59ff8423f9 [HUDI-2223] Fix Alter Partitioned Table Failed (#3350) pengzhiwei 2021-07-27 20:01:04 +08:00
  • 925873bb3c [HUDI-2217] Fix no value present in incremental query on MOR (#3340) Gary Li 2021-07-27 17:30:01 +08:00
  • ab2e0d0ba2 [HUDI-2219] Fix NPE of HoodieConfig (#3342) Danny Chan 2021-07-27 15:18:05 +08:00
  • 9d2a65a6a6 [HUDI-2209] Bulk insert for flink writer (#3334) Danny Chan 2021-07-27 10:58:23 +08:00
  • 024cf01f02 [MINOR] Correct the words accroding in the comments to according (#3343) xiang2102 2021-07-27 08:48:58 +08:00
  • 61148c1c43 [HUDI-2176, 2178, 2179] Adding virtual key support to COW table (#3306) Sivabalan Narayanan 2021-07-26 17:21:04 -04:00
  • 5353243449 [HUDI-2214]residual temporary files after clustering are not cleaned up (#3335) xiarixiaoyao 2021-07-27 01:26:20 +08:00
  • a5638b995b [MINOR] Close log scanner after compaction completed (#3294) Gary Li 2021-07-26 17:39:13 +08:00
  • a91296f14a [HUDI-2216] Correct the words fiels in the comments to fields (#3339) 董可伦 2021-07-25 12:15:57 +08:00
  • a14b19fdd5 [HUDI-1241] Automate the generation of configs webpage as configs are added to Hudi repo (#3302) rmahindra123 2021-07-23 21:33:34 -07:00
  • b2f7fcb8c8 [MINOR] Replace deprecated method isDir with isDirectory (#3319) Xuedong Luan 2021-07-24 10:02:24 +08:00
  • 66207ed91a [HUDI-1848] Adding support for HMS for running DDL queries in hive-sy… (#2879) jsbali 2021-07-23 21:33:15 +05:30
  • 71e14cf866 [HUDI-2213] Remove unnecessary parameter for HoodieMetrics constructor and fix NPE in UT (#3333) Xuedong Luan 2021-07-23 19:57:35 +08:00
  • 2c910ee3af [HUDI-2212] Missing PrimaryKey In Hoodie Properties For CTAS Table (#3332) pengzhiwei 2021-07-23 15:21:57 +08:00
  • 6d592c5896 [HUDI-2211] Fix NullPointerException in TestHoodieConsoleMetrics (#3331) Xuedong Luan 2021-07-23 11:22:54 +08:00
  • 5a2f3d439e [HUDI-2139] MergeInto MOR Table May Result InCorrect Result (#3230) pengzhiwei 2021-07-23 10:19:43 +08:00
  • c89bf1de20 [HUDI-2205] Rollback inflight compaction for flink writer (#3320) Danny Chan 2021-07-22 22:56:51 +08:00
  • fe5d2e7f53 [HUDI-2206] Fix checkpoint blocked because getLastPendingInstant() action after than restoreWriteMetadata() action (#3326) swuferhong 2021-07-22 16:35:07 +08:00
  • 151f22e43a [HUDI-2195] Sync Hive Failed When Execute CTAS In Spark2 And Spark3 (#3299) pengzhiwei 2021-07-22 15:33:38 +08:00
  • 2370a9facb [HUDI-2204] Add marker files for flink writer (#3316) Danny Chan 2021-07-22 13:34:15 +08:00
  • 5a94b6bf54 [HUDI-2192] Clean up Multiple versions of scala libraries detected Warning (#3292) Vinay Patil 2021-07-21 13:03:27 +05:30
  • 4f1350f7c1 [MINOR] Disable codecov (#3314) satishkotha 2021-07-20 22:07:22 -07:00
  • d58a8348dc [HUDI-2007] Fixing hudi_test_suite for spark nodes and adding spark bulk_insert node (#3074) Sivabalan Narayanan 2021-07-21 00:11:01 -04:00
  • 858e84b5b2 [HUDI-2198] Clean and reset the bootstrap events for coordinator when task failover (#3304) Danny Chan 2021-07-21 10:13:05 +08:00
  • 634163a990 [HUDI-2145] Create new bucket when NewFileAssignState filled (#3258) yuzhaojing 2021-07-20 17:46:45 +08:00
  • a086d255c8 [HUDI-1860] Add INSERT_OVERWRITE and INSERT_OVERWRITE_TABLE support to DeltaStreamer (#3184) Samrat 2021-07-20 07:19:43 +05:30
  • d5026e9a24 [HUDI-2161] Adding support to disable meta columns with bulk insert operation (#3247) Sivabalan Narayanan 2021-07-19 20:43:48 -04:00
  • 2099bf41db [HUDI-2193] Remove state in BootstrapFunction 喻兆靖 2021-07-19 15:51:33 +08:00
  • 572a214412 [HUDI-1884] MergeInto Support Partial Update For COW (#3154) pengzhiwei 2021-07-17 12:59:18 +08:00
  • af837d2f18 [HUDI-1447] DeltaStreamer kafka source supports consuming from specified timestamp (#2438) liujinhui 2021-07-17 12:31:06 +08:00
  • 50c2b76d72 Revert "[HUDI-2087] Support Append only in Flink stream (#3252)" yuzhao.cyz 2021-07-16 18:05:33 +08:00
  • c8aaf00819 [HUDI-2185] Remove the default parallelism of index bootstrap and bucket assigner yuzhao.cyz 2021-07-16 14:22:53 +08:00
  • 3b264e80d9 [HUDI-1633] Make callback return HoodieWriteStat (#2445) liujinhui 2021-07-16 12:37:07 +08:00
  • 38cd74b563 [MINOR] Allow users to choose ORC as base file format in Spark SQL (#3279) Jintao Guan 2021-07-15 21:24:41 -07:00