pengzhiwei
59ff8423f9
[HUDI-2223] Fix Alter Partitioned Table Failed ( #3350 )
2021-07-27 20:01:04 +08:00
Gary Li
925873bb3c
[HUDI-2217] Fix no value present in incremental query on MOR ( #3340 )
2021-07-27 17:30:01 +08:00
Sivabalan Narayanan
61148c1c43
[HUDI-2176, 2178, 2179] Adding virtual key support to COW table ( #3306 )
2021-07-26 17:21:04 -04:00
xiarixiaoyao
5353243449
[HUDI-2214]residual temporary files after clustering are not cleaned up ( #3335 )
2021-07-26 10:26:20 -07:00
董可伦
a91296f14a
[HUDI-2216] Correct the words fiels in the comments to fields ( #3339 )
2021-07-25 12:15:57 +08:00
pengzhiwei
2c910ee3af
[HUDI-2212] Missing PrimaryKey In Hoodie Properties For CTAS Table ( #3332 )
2021-07-23 15:21:57 +08:00
pengzhiwei
5a2f3d439e
[HUDI-2139] MergeInto MOR Table May Result InCorrect Result ( #3230 )
2021-07-23 10:19:43 +08:00
pengzhiwei
151f22e43a
[HUDI-2195] Sync Hive Failed When Execute CTAS In Spark2 And Spark3 ( #3299 )
2021-07-22 15:33:38 +08:00
Vinay Patil
5a94b6bf54
[HUDI-2192] Clean up Multiple versions of scala libraries detected Warning ( #3292 )
2021-07-21 00:33:27 -07:00
Sivabalan Narayanan
d5026e9a24
[HUDI-2161] Adding support to disable meta columns with bulk insert operation ( #3247 )
2021-07-19 20:43:48 -04:00
pengzhiwei
572a214412
[HUDI-1884] MergeInto Support Partial Update For COW ( #3154 )
2021-07-17 12:59:18 +08:00
Jintao Guan
38cd74b563
[MINOR] Allow users to choose ORC as base file format in Spark SQL ( #3279 )
2021-07-16 12:24:41 +08:00
vinoth chandar
75040ee9e5
[HUDI-2149] Ensure and Audit docs for every configuration class in the codebase ( #3272 )
...
- Added docs when missing
- Rewrote, reworded as needed
- Made couple more classes extend HoodieConfig
2021-07-14 10:56:08 -07:00
pengzhiwei
f0a2f378ea
Merge pull request #3120 from pengzhiwei2018/dev_metasync
...
[HUDI-2045] Support Read Hoodie As DataSource Table For Flink And DeltaStreamer
2021-07-13 22:37:20 +08:00
pengzhiwei
ca440ccf88
[HUDI-2107] Support Read Log Only MOR Table For Spark ( #3193 )
2021-07-12 17:31:23 +08:00
pengzhiwei
ffa934182a
[HUDI-2045] Support Read Hoodie As DataSource Table For Flink And DeltaStreamer
2021-07-12 13:03:14 +08:00
Sagar Sumit
5804ad8e32
[HUDI-1483] Support async clustering for deltastreamer and Spark streaming ( #3142 )
...
- Integrate async clustering service with HoodieDeltaStreamer and HoodieStreamingSink
- Added methods in HoodieAsyncService to reuse code
2021-07-11 14:43:38 -04:00
Sivabalan Narayanan
8c0dbaa9b3
[HUDI-2009] Fixing extra commit metadata in row writer path ( #3075 )
2021-07-08 03:07:27 -04:00
Sivabalan Narayanan
16e90d30ea
[HUDI-1105] Adding dedup support for Bulk Insert w/ Rows ( #2206 )
2021-07-07 17:38:26 -04:00
Sebastian Bernauer
8f7ad8b178
[HUDI-2069] Refactored String constants ( #3172 )
2021-07-07 14:22:00 -04:00
Sivabalan Narayanan
ea9e5d0e8b
[HUDI-1104] Adding support for UserDefinedPartitioners and SortModes to BulkInsert with Rows ( #3149 )
2021-07-07 11:15:25 -04:00
xiarixiaoyao
a0f598d371
[HUDI-2089]fix the bug that metatable cannot support non_partition table ( #3182 )
2021-07-06 11:14:05 +08:00
pengzhiwei
287d2dd79c
[HUDI-2131] Exception Throw Out When MergeInto With Decimal Type Field ( #3224 )
2021-07-05 22:28:57 +08:00
xiarixiaoyao
2cecb75187
[HUDI-2058]support incremental query for insert_overwrite_table/insert_overwrite operation on cow table ( #3139 )
2021-07-05 18:54:05 +08:00
xiarixiaoyao
6a71412f78
[HUDI-2116] Support batch synchronization of partition datas to hive metastore to avoid oom problem ( #3209 )
2021-07-04 22:30:36 +08:00
pengzhiwei
4f215e2938
[HUDI-2057] CTAS Generate An External Table When Create Managed Table ( #3146 )
2021-07-03 15:55:36 +08:00
pengzhiwei
70d9c2e747
[HUDI-2123] Exception When Merge With Null-Value Field ( #3214 )
2021-07-02 22:46:52 +08:00
pengzhiwei
ac65189458
[HUDI-2114] Spark Query MOR Table Written By Flink Return Incorrect Timestamp Value ( #3208 )
2021-07-02 17:39:57 +08:00
pengzhiwei
6403547431
[HUDI-2051] Enable Hive Sync When Spark Enable Hive Meta For Spark Sql ( #3126 )
2021-07-02 01:08:36 -07:00
pengzhiwei
6eca06d074
[HUDI-2105] Compaction Failed For MergeInto MOR Table ( #3190 )
2021-07-01 23:40:14 +08:00
wenningd
d412fb2fe6
[HUDI-89] Add configOption & refactor all configs based on that ( #2833 )
...
Co-authored-by: Wenning Ding <wenningd@amazon.com >
2021-06-30 14:26:30 -07:00
Sebastian Bernauer
b32855545b
[HUDI-2069] Fix KafkaAvroSchemaDeserializer to not rely on reflection ( #3111 )
...
[HUDI-2069] KafkaAvroSchemaDeserializer should get sourceSchema passed instead using Reflection
2021-06-24 09:08:21 -04:00
pengzhiwei
84dd3ca18b
[HUDI-2053] Insert Static Partition With DateType Return Incorrect Partition Value ( #3133 )
2021-06-24 19:09:37 +08:00
pengzhiwei
7e50f9a5a6
[HUDI-2061] Incorrect Schema Inference For Schema Evolved Table ( #3137 )
2021-06-23 22:48:01 -07:00
pengzhiwei
69c0d9e2d0
[HUDI-1883] Support Truncate Table For Hoodie ( #3098 )
2021-06-22 22:33:20 +08:00
pengzhiwei
4fd8a88b7e
[HUDI-1776] Support AlterCommand For Hoodie ( #3086 )
2021-06-21 22:58:43 +08:00
pengzhiwei
b9e28e5292
[HUDI-2033] ClassCastException Throw When PreCombineField Is String Type ( #3099 )
2021-06-17 23:21:20 +08:00
pengzhiwei
ad53cf450e
[HUDI-1879] Fix RO Tables Returning Snapshot Result ( #2925 )
2021-06-17 04:18:21 -07:00
Jintao Guan
b8fe5b91d5
[HUDI-764] [HUDI-765] ORC reader writer Implementation ( #2999 )
...
Co-authored-by: Qingyun (Teresa) Kang <kteresa@uber.com >
2021-06-15 15:21:43 -07:00
Sivabalan Narayanan
7d9f9d7d82
[HUDI-1991] Fixing drop dups exception in bulk insert row writer path ( #3055 )
2021-06-14 09:55:52 +08:00
wangxianghu
7261f08507
[HUDI-1929] Support configure KeyGenerator by type ( #2993 )
2021-06-08 09:26:10 -04:00
pengzhiwei
f760ec543e
[HUDI-1659] Basic Implement Of Spark Sql Support For Hoodie ( #2645 )
...
Main functions:
Support create table for hoodie.
Support CTAS.
Support Insert for hoodie. Including dynamic partition and static partition insert.
Support MergeInto for hoodie.
Support DELETE
Support UPDATE
Both support spark2 & spark3 based on DataSourceV1.
Main changes:
Add sql parser for spark2.
Add HoodieAnalysis for sql resolve and logical plan rewrite.
Add commands implementation for CREATE TABLE、INSERT、MERGE INTO & CTAS.
In order to push down the update&insert logical to the HoodieRecordPayload for MergeInto, I make same change to the
HoodieWriteHandler and other related classes.
1、Add the inputSchema for parser the incoming record. This is because the inputSchema for MergeInto is different from writeSchema as there are some transforms in the update& insert expression.
2、Add WRITE_SCHEMA to HoodieWriteConfig to pass the write schema for merge into.
3、Pass properties to HoodieRecordPayload#getInsertValue to pass the insert expression and table schema.
Verify this pull request
Add TestCreateTable for test create hoodie tables and CTAS.
Add TestInsertTable for test insert hoodie tables.
Add TestMergeIntoTable for test merge hoodie tables.
Add TestUpdateTable for test update hoodie tables.
Add TestDeleteTable for test delete hoodie tables.
Add TestSqlStatement for test supported ddl/dml currently.
2021-06-07 23:24:32 -07:00
Vinay Patil
2a7e1e091e
[HUDI-1942] Add Default value for HIVE_AUTO_CREATE_DATABASE_OPT_KEY in HoodieSparkSqlWriter ( #3036 )
2021-06-05 18:02:26 -04:00
wangxianghu
870e97b5f8
[MINOR] Remove unused method in DataSourceUtils ( #3031 )
2021-06-03 10:24:51 -07:00
pengzhiwei
dcd7c331dc
[HUDI-1879] Support Partition Prune For MergeOnRead Snapshot Table ( #2926 )
2021-05-29 07:50:24 -07:00
leesf
112732db81
[HUDI-1922] Bulk insert with row writer supports mor table ( #2981 )
2021-05-25 09:40:22 -07:00
wangxianghu
e7020748b5
[HUDI-1920] Set archived as the default value of HOODIE_ARCHIVELOG_FOLDER_PROP_NAME ( #2978 )
2021-05-25 16:29:55 +08:00
mpouttu
369a849337
[HUDI-1873] collect() call causing issues with very large upserts ( #2907 )
...
Co-authored-by: Sivabalan Narayanan <sivabala@uber.com >
2021-05-24 01:29:01 -04:00
Sivabalan Narayanan
5d1f592395
[HUDI-1806] Honoring skipROSuffix in spark ds ( #2882 )
...
* Honoring skipROSuffix in spark ds
* Adding tests
* fixing scala checkstype issue
2021-05-18 16:11:39 -07:00
xoln ann
12443e4187
[HUDI-1446] Support skip bootstrapIndex's init in abstract fs view init ( #2520 )
...
Co-authored-by: zhongliang <zhongliang@kuaishou.com >
Co-authored-by: Sivabalan Narayanan <sivabala@uber.com >
2021-05-14 00:29:26 -04:00