Sivabalan Narayanan
c9fa3cffaf
[HUDI-1774] Adding support for delete_partitions to spark data source ( #3437 )
2021-08-11 01:03:01 -04:00
Shawy Geng
a5e496fe23
[HUDI-2292] MOR should not predicate pushdown when reading with payload_combine type ( #3443 )
2021-08-11 12:17:39 +08:00
swuferhong
5448cdde7e
[HUDI-2170] [HUDI-1763] Always choose the latest record for HoodieRecordPayload ( #3401 )
2021-08-11 10:20:55 +08:00
Sivabalan Narayanan
1196736185
[HUDI-1129] Improving schema evolution support in hudi ( #2927 )
...
* Adding support to ingest records with old schema after table's schema is evolved
* Rebasing against latest master
- Trimming test file to be < 800 lines
- Renaming config names
* Addressing feedback
Co-authored-by: Vinoth Chandar <vinoth@apache.org >
2021-08-10 09:15:37 -07:00
zhangyue19921010
73d898322b
[MINOR] Fix travis from errors ( #3432 )
2021-08-10 08:25:49 -07:00
pengzhiwei
41a9986a76
[HUDI-2208] Support Bulk Insert For Spark Sql ( #3328 )
2021-08-09 00:18:31 -04:00
pengzhiwei
32a50d8ddb
[HUDI-2243] Support Time Travel Query For Hoodie Table ( #3360 )
2021-08-07 19:07:22 -04:00
pengzhiwei
55d2e786db
[HUDI-1842] Spark Sql Support For pre-existing Hoodie Table ( #3393 )
2021-08-07 07:49:26 -04:00
pengzhiwei
3f8ca1a355
[HUDI-2182] Support Compaction Command For Spark Sql ( #3277 )
2021-08-06 15:12:10 +08:00
Sivabalan Narayanan
1df5ded433
[HUDI-2273] Migrating some long running tests to functional test profile ( #3398 )
2021-08-04 19:08:50 -04:00
pengzhiwei
5574e092fb
[HUDI-2232] [SQL] MERGE INTO fails with table having nested struct ( #3379 )
2021-08-04 18:20:29 +08:00
wenningd
91bb0d1318
[HUDI-2255] Refactor Datasource options ( #3373 )
...
Co-authored-by: Wenning Ding <wenningd@amazon.com >
2021-08-03 17:50:30 -07:00
Udit Mehrotra
1ff2d3459a
[HUDI-1371] [HUDI-1893] Support metadata based listing for Spark DataSource and Spark SQL ( #2893 )
2021-08-03 14:47:40 -07:00
vinoth chandar
b21ae68e67
[MINOR] Improving runtime of TestStructuredStreaming by 2 mins ( #3382 )
2021-08-02 13:42:46 -07:00
Sivabalan Narayanan
fe508376fa
[HUDI-2177][HUDI-2200] Adding virtual keys support for MOR table ( #3315 )
2021-08-02 09:45:09 -04:00
pengzhiwei
c2370402ea
[HUDI-2251] Fix Exception Cause By Table Name Case Sensitivity For Append Mode Write ( #3367 )
2021-07-29 17:36:56 -04:00
pengzhiwei
f109c6cb0d
[MINOR] fix check style error ( #3365 )
2021-07-29 17:29:10 +08:00
pengzhiwei
bbadac7de1
[HUDI-1425] Performance loss with the additional hoodieRecords.isEmpty() in HoodieSparkSqlWriter#write ( #2296 )
2021-07-28 21:30:18 -07:00
pengzhiwei
59ff8423f9
[HUDI-2223] Fix Alter Partitioned Table Failed ( #3350 )
2021-07-27 20:01:04 +08:00
Gary Li
925873bb3c
[HUDI-2217] Fix no value present in incremental query on MOR ( #3340 )
2021-07-27 17:30:01 +08:00
Sivabalan Narayanan
61148c1c43
[HUDI-2176, 2178, 2179] Adding virtual key support to COW table ( #3306 )
2021-07-26 17:21:04 -04:00
xiarixiaoyao
5353243449
[HUDI-2214]residual temporary files after clustering are not cleaned up ( #3335 )
2021-07-26 10:26:20 -07:00
pengzhiwei
2c910ee3af
[HUDI-2212] Missing PrimaryKey In Hoodie Properties For CTAS Table ( #3332 )
2021-07-23 15:21:57 +08:00
pengzhiwei
5a2f3d439e
[HUDI-2139] MergeInto MOR Table May Result InCorrect Result ( #3230 )
2021-07-23 10:19:43 +08:00
Sivabalan Narayanan
d5026e9a24
[HUDI-2161] Adding support to disable meta columns with bulk insert operation ( #3247 )
2021-07-19 20:43:48 -04:00
pengzhiwei
572a214412
[HUDI-1884] MergeInto Support Partial Update For COW ( #3154 )
2021-07-17 12:59:18 +08:00
Jintao Guan
38cd74b563
[MINOR] Allow users to choose ORC as base file format in Spark SQL ( #3279 )
2021-07-16 12:24:41 +08:00
pengzhiwei
f0a2f378ea
Merge pull request #3120 from pengzhiwei2018/dev_metasync
...
[HUDI-2045] Support Read Hoodie As DataSource Table For Flink And DeltaStreamer
2021-07-13 22:37:20 +08:00
pengzhiwei
ca440ccf88
[HUDI-2107] Support Read Log Only MOR Table For Spark ( #3193 )
2021-07-12 17:31:23 +08:00
pengzhiwei
ffa934182a
[HUDI-2045] Support Read Hoodie As DataSource Table For Flink And DeltaStreamer
2021-07-12 13:03:14 +08:00
Sagar Sumit
5804ad8e32
[HUDI-1483] Support async clustering for deltastreamer and Spark streaming ( #3142 )
...
- Integrate async clustering service with HoodieDeltaStreamer and HoodieStreamingSink
- Added methods in HoodieAsyncService to reuse code
2021-07-11 14:43:38 -04:00
Sivabalan Narayanan
16e90d30ea
[HUDI-1105] Adding dedup support for Bulk Insert w/ Rows ( #2206 )
2021-07-07 17:38:26 -04:00
Sivabalan Narayanan
ea9e5d0e8b
[HUDI-1104] Adding support for UserDefinedPartitioners and SortModes to BulkInsert with Rows ( #3149 )
2021-07-07 11:15:25 -04:00
xiarixiaoyao
a0f598d371
[HUDI-2089]fix the bug that metatable cannot support non_partition table ( #3182 )
2021-07-06 11:14:05 +08:00
pengzhiwei
287d2dd79c
[HUDI-2131] Exception Throw Out When MergeInto With Decimal Type Field ( #3224 )
2021-07-05 22:28:57 +08:00
xiarixiaoyao
2cecb75187
[HUDI-2058]support incremental query for insert_overwrite_table/insert_overwrite operation on cow table ( #3139 )
2021-07-05 18:54:05 +08:00
pengzhiwei
4f215e2938
[HUDI-2057] CTAS Generate An External Table When Create Managed Table ( #3146 )
2021-07-03 15:55:36 +08:00
pengzhiwei
70d9c2e747
[HUDI-2123] Exception When Merge With Null-Value Field ( #3214 )
2021-07-02 22:46:52 +08:00
pengzhiwei
ac65189458
[HUDI-2114] Spark Query MOR Table Written By Flink Return Incorrect Timestamp Value ( #3208 )
2021-07-02 17:39:57 +08:00
pengzhiwei
6403547431
[HUDI-2051] Enable Hive Sync When Spark Enable Hive Meta For Spark Sql ( #3126 )
2021-07-02 01:08:36 -07:00
pengzhiwei
6eca06d074
[HUDI-2105] Compaction Failed For MergeInto MOR Table ( #3190 )
2021-07-01 23:40:14 +08:00
wenningd
d412fb2fe6
[HUDI-89] Add configOption & refactor all configs based on that ( #2833 )
...
Co-authored-by: Wenning Ding <wenningd@amazon.com >
2021-06-30 14:26:30 -07:00
pengzhiwei
84dd3ca18b
[HUDI-2053] Insert Static Partition With DateType Return Incorrect Partition Value ( #3133 )
2021-06-24 19:09:37 +08:00
pengzhiwei
69c0d9e2d0
[HUDI-1883] Support Truncate Table For Hoodie ( #3098 )
2021-06-22 22:33:20 +08:00
pengzhiwei
4fd8a88b7e
[HUDI-1776] Support AlterCommand For Hoodie ( #3086 )
2021-06-21 22:58:43 +08:00
pengzhiwei
b9e28e5292
[HUDI-2033] ClassCastException Throw When PreCombineField Is String Type ( #3099 )
2021-06-17 23:21:20 +08:00
pengzhiwei
ad53cf450e
[HUDI-1879] Fix RO Tables Returning Snapshot Result ( #2925 )
2021-06-17 04:18:21 -07:00
Sivabalan Narayanan
7d9f9d7d82
[HUDI-1991] Fixing drop dups exception in bulk insert row writer path ( #3055 )
2021-06-14 09:55:52 +08:00
pengzhiwei
f760ec543e
[HUDI-1659] Basic Implement Of Spark Sql Support For Hoodie ( #2645 )
...
Main functions:
Support create table for hoodie.
Support CTAS.
Support Insert for hoodie. Including dynamic partition and static partition insert.
Support MergeInto for hoodie.
Support DELETE
Support UPDATE
Both support spark2 & spark3 based on DataSourceV1.
Main changes:
Add sql parser for spark2.
Add HoodieAnalysis for sql resolve and logical plan rewrite.
Add commands implementation for CREATE TABLE、INSERT、MERGE INTO & CTAS.
In order to push down the update&insert logical to the HoodieRecordPayload for MergeInto, I make same change to the
HoodieWriteHandler and other related classes.
1、Add the inputSchema for parser the incoming record. This is because the inputSchema for MergeInto is different from writeSchema as there are some transforms in the update& insert expression.
2、Add WRITE_SCHEMA to HoodieWriteConfig to pass the write schema for merge into.
3、Pass properties to HoodieRecordPayload#getInsertValue to pass the insert expression and table schema.
Verify this pull request
Add TestCreateTable for test create hoodie tables and CTAS.
Add TestInsertTable for test insert hoodie tables.
Add TestMergeIntoTable for test merge hoodie tables.
Add TestUpdateTable for test update hoodie tables.
Add TestDeleteTable for test delete hoodie tables.
Add TestSqlStatement for test supported ddl/dml currently.
2021-06-07 23:24:32 -07:00
pengzhiwei
dcd7c331dc
[HUDI-1879] Support Partition Prune For MergeOnRead Snapshot Table ( #2926 )
2021-05-29 07:50:24 -07:00