lanyuanxiaoyao/hudi - hudi - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
liujinhui	b7da6cb33d	[HUDI-2307] When using delete_partition with ds should not rely on the primary key (#3469 ) - Co-authored-by: Sivabalan Narayanan <n.siva.b@gmail.com>	2021-08-14 02:53:39 -04:00
pengzhiwei	f760ec543e	[HUDI-1659] Basic Implement Of Spark Sql Support For Hoodie (#2645 ) Main functions: Support create table for hoodie. Support CTAS. Support Insert for hoodie. Including dynamic partition and static partition insert. Support MergeInto for hoodie. Support DELETE Support UPDATE Both support spark2 & spark3 based on DataSourceV1. Main changes: Add sql parser for spark2. Add HoodieAnalysis for sql resolve and logical plan rewrite. Add commands implementation for CREATE TABLE、INSERT、MERGE INTO & CTAS. In order to push down the update&insert logical to the HoodieRecordPayload for MergeInto, I make same change to the HoodieWriteHandler and other related classes. 1、Add the inputSchema for parser the incoming record. This is because the inputSchema for MergeInto is different from writeSchema as there are some transforms in the update& insert expression. 2、Add WRITE_SCHEMA to HoodieWriteConfig to pass the write schema for merge into. 3、Pass properties to HoodieRecordPayload#getInsertValue to pass the insert expression and table schema. Verify this pull request Add TestCreateTable for test create hoodie tables and CTAS. Add TestInsertTable for test insert hoodie tables. Add TestMergeIntoTable for test merge hoodie tables. Add TestUpdateTable for test update hoodie tables. Add TestDeleteTable for test delete hoodie tables. Add TestSqlStatement for test supported ddl/dml currently.	2021-06-07 23:24:32 -07:00
Danny Chan	8b774fe331	[HUDI-1495] Bump Flink version to 1.12.2 (#2718 )	2021-03-26 14:25:57 +08:00
Danny Chan	89003bc780	[HUDI-1647] Supports snapshot read for Flink (#2613 )	2021-03-05 08:49:32 +08:00
Shen Hong	236d1b0dec	[HUDI-1439] Remove scala dependency from hudi-client-common (#2306 )	2020-12-11 00:36:37 -08:00
Mathieu	1f7add9291	[HUDI-1089] Refactor hudi-client to support multi-engine (#1827 ) - This change breaks `hudi-client` into `hudi-client-common` and `hudi-spark-client` modules - Simple usages of Spark using jsc.parallelize() has been redone using EngineContext#map, EngineContext#flatMap etc - Code changes in the PR, break classes into `BaseXYZ` parent classes with no spark dependencies living in `hudi-client-common` - Classes on `hudi-spark-client` are named `SparkXYZ` extending the parent classes with all the Spark dependencies - To simplify/cleanup, HoodieIndex#fetchRecordLocation has been removed and its usages in tests replaced with alternatives Co-authored-by: Vinoth Chandar <vinoth@apache.org>	2020-10-01 14:25:29 -07:00
Sivabalan Narayanan	379cf0786f	[HUDI-1013] Adding Bulk Insert V2 implementation (#1834 ) - Adding ability to use native spark row writing for bulk_insert - Controlled by `ENABLE_ROW_WRITER_OPT_KEY` datasource write option - Introduced KeyGeneratorInterface in hudi-client, moved KeyGenerator back to hudi-spark - Simplified the new API additions to just two new methods : getRecordKey(row), getPartitionPath(row) - Fixed all built-in key generators with new APIs - Made the field position map lazily created upon the first call to row based apis - Implemented native row based key generators for CustomKeyGenerator - Fixed all the tests, with these new APIs Co-authored-by: Balaji Varadarajan <varadarb@uber.com> Co-authored-by: Vinoth Chandar <vinoth@apache.org>	2020-08-13 00:33:39 -07:00
Raymond Xu	3b9a30528b	[HUDI-996] Add functional test suite for hudi-utilities (#1746 ) - Share resources for functional tests - Add suite for functional test classes from hudi-utilities	2020-07-05 16:44:31 -07:00
Raymond Xu	366bb10d8c	[HUDI-812] Migrate hudi common tests to JUnit 5 (#1590 ) * [HUDI-812] Migrate hudi-common tests to JUnit 5	2020-05-06 19:15:20 +08:00
Suneel Marthi	8c3001363d	HUDI-479: Eliminate or Minimize use of Guava if possible (#1159 )	2020-03-28 03:11:32 -04:00
lamber-ken	4b1b3fc28c	[MINOR] Set info servity for ImportOrder temporarily (#1127 ) - Now we need fix import check error manually, disable the rule temporarily before finding a better solution.	2019-12-24 19:07:04 +08:00
lamber-ken	313fab5fd1	[HUDI-444] Refactor the codes based on scala codestyle ReturnChecker rule (#1121 )	2019-12-24 07:05:54 +08:00
lamber-ken	b284091783	[HUDI-386] Refactor hudi scala checkstyle rules (#1099 )	2019-12-22 07:30:07 +08:00
lamber-ken	2745b7552f	[HUDI-379] Refactor the codes based on new JavadocStyle code style rule (#1079 )	2019-12-06 12:59:28 +08:00
lamber-ken	c06d89b648	[HUDI-378] Refactor the rest codes based on new ImportOrder code style rule (#1078 )	2019-12-05 17:25:03 +08:00
lamber-ken	b3e0ebbc4a	[checkstyle] Add ConstantName java checkstyle rule (#1066 ) * add SimplifyBooleanExpression java checkstyle rule * collapse empty tags in scalastyle file	2019-12-04 18:59:15 +08:00
谢磊	b77fad39b5	[HUDI-364] Refactor hudi-hive based on new ImportOrder code style rule (#1048 ) [HUDI-364] Refactor hudi-hive based on new ImportOrder code style rule	2019-11-27 16:30:37 +08:00
谢磊	212282c8aa	[HUDI-358] Add Java-doc and importOrder checkstyle rule (#1043 ) - import groups are separated by one blank line - org.apache.hudi.* at the top location	2019-11-25 11:36:23 -08:00
谢磊	804e348d0e	[HUDI-346] Set allowMultipleEmptyLines to false for EmptyLineSeparator rule (#1025 )	2019-11-19 18:44:42 +08:00
lamber-ken	045fa87a3d	[HUDI-330] add EmptyStatement java checkstyle rule	2019-11-13 14:11:11 -08:00
Balaji Varadarajan	77f4e73615	[HUDI-121] Fix licensing issues found during RC voting by general incubator group	2019-10-16 02:09:02 -07:00
leesf	b19bed442d	[HUDI-296] Explore use of spotless to auto fix formatting errors (#945 ) - Add spotless format fixing to project - One time reformatting for conformity - Build fails for formatting changes and mvn spotless:apply autofixes them	2019-10-10 05:19:40 -07:00
vinoth chandar	7a973a6944	[HUDI-159] Redesigning bundles for lighter-weight integrations - Documented principles applied for redesign at packaging/README.md - No longer depends on incl commons-codec, commons-io, commons-pool, commons-dbcp, commons-lang, commons-logging, avro-mapred - Introduce new FileIOUtils & added checkstyle rule for illegal import of above - Parquet, Avro dependencies moved to provided scope to enable being picked up from Hive/Spark/Presto instead - Pickup jackson jars for Hive sync tool from HIVE_HOME & unbundling jackson everywhere - Remove hive-jdbc standalone jar from being bundled in Spark/Hive/Utilities bundles - 6.5x reduced number of classes across bundles	2019-09-11 11:08:27 -07:00
leesf	8b150a3c6b	[HUDI-230] Add missing Apache License in some files	2019-08-30 09:38:28 -07:00
vinoyang	8f5e7ad5d9	[HUDI-205] Let checkstyle ban Java and Guava Optional instead of using Option provided by Hudi (#834 )	2019-08-13 17:13:52 -07:00
arukavytsia	6946dd7557	General enhancements	2018-12-18 12:52:39 -08:00
Balaji Varadarajan	788e4f2d2e	CodeStyle formatting to conform to basic Checkstyle rules. The code-style rules follow google style with some changes: 1. Increase line length from 100 to 120 2. Disable JavaDoc related checkstyles as this needs more manual work. Both source and test code are checked for code-style	2018-03-30 11:09:40 -07:00

27 Commits