lanyuanxiaoyao/hudi - hudi - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
vinoth chandar	9706f659db	[HUDI-508] Standardizing on "Table" instead of "Dataset" across code (#1197 ) - Docs were talking about storage types before, cWiki moved to "Table" - Most of code already has HoodieTable, HoodieTableMetaClient - correct naming - Replacing renaming use of dataset across code/comments - Few usages in comments and use of Spark SQL DataSet remain unscathed	2020-01-07 12:52:32 -08:00
lamber-ken	75c3f630d4	[HUDI-405] Remove HIVE_ASSUME_DATE_PARTITION_OPT_KEY config from DataSource	2020-01-06 14:25:38 -08:00
Pratyaksh Sharma	8f935e779a	[HUDI-406]: added default partition path in TimestampBasedKeyGenerator	2020-01-06 09:38:06 -08:00
hongdd	2d5b79d96f	[HUDI-438] Merge duplicated code fragment in HoodieSparkSqlWriter (#1114 )	2020-01-06 22:51:22 +08:00
Sivabalan Narayanan	7031445eb3	[HUDI-377] Adding Delete() support to DeltaStreamer (#1073 ) - Provides ability to perform hard deletes by writing delete marker records into the source data - if the record contains a special field _hoodie_delete_marker set to true, deletes are performed	2020-01-04 11:07:31 -08:00
Pratyaksh Sharma	dde21e7315	[HUDI-402]: code clean up in test cases	2019-12-31 11:10:49 -08:00
vinoth chandar	350b0ecb4d	[HUDI-311] : Support for AWS Database Migration Service in DeltaStreamer - Add a transformer class, that adds `Op` fiels if not found in input frame - Add a payload implementation, that issues deletes when Op=D - Remove Parquet as a top level source type, consolidate with RowSource - Made delta streamer work without a property file, simply using overridden cli options - Unit tests for transformer/payload classes	2019-12-23 20:56:55 -08:00
lamber-ken	313fab5fd1	[HUDI-444] Refactor the codes based on scala codestyle ReturnChecker rule (#1121 )	2019-12-24 07:05:54 +08:00
YanJia-Gary-Li	36b3b6f5dd	[HUDI-415] Get commit time when Spark start (#1113 )	2019-12-19 22:19:06 -08:00
lamber-ken	a405d3873b	[MINOR] replace scala map add operator (#1093 ) replace ++: with ++	2019-12-12 11:29:17 +08:00
lamber-ken	ba514cfea0	[MINOR] Remove redundant plus operator (#1097 )	2019-12-12 05:42:05 +08:00
lamber-ken	d447e2d751	[checkstyle] Unify LOG form (#1092 )	2019-12-10 19:23:38 +08:00
Wenning Ding	e555aa516d	[HUDI-353] Add hive style partitioning path	2019-12-09 12:29:53 -08:00
lamber-ken	2745b7552f	[HUDI-379] Refactor the codes based on new JavadocStyle code style rule (#1079 )	2019-12-06 12:59:28 +08:00
hongdd	b65a897856	[HUDI-374] Unable to generateUpdates in QuickstartUtils (#1059 )	2019-11-30 11:11:00 -08:00
lamber-ken	024230fbd2	[HUDI-372] Support the shortName for Hudi DataSource (#1054 ) - Ability to do `spark.write.format("hudi")...`	2019-11-30 08:02:33 -08:00
谢磊	f9139c0f61	[HUDI-366] Refactor some module codes based on new ImportOrder code style rule (#1055 ) [HUDI-366] Refactor hudi-hadoop-mr / hudi-timeline-service / hudi-spark / hudi-integ-test / hudi- utilities based on new ImportOrder code style rule	2019-11-27 21:32:43 +08:00
bschell	60fed21dc7	[HUDI-327] Add null/empty checks to key generators (#1040 ) * Adds null and empty checks to all key generators. * Also improves error messaging for key generator issues.	2019-11-26 02:37:16 -08:00
filippo balicchia	845a0509b3	[MINOR] Some minor optimizations in HoodieJavaStreamingApp (#1046 )	2019-11-25 18:49:13 +08:00
Sivabalan Narayanan	c3355109b1	[HUDI-328] Adding delete api to HoodieWriteClient (#1004 ) [HUDI-328] Adding delete api to HoodieWriteClient and Spark DataSource	2019-11-22 15:05:25 -08:00
hongdd	7bc08cbfdc	[HUDI-345] Fix used deprecated function (#1024 ) - Schema.parse() with new Schema.Parser().parse - FSDataOutputStream constructor	2019-11-22 03:32:09 -08:00
谢磊	804e348d0e	[HUDI-346] Set allowMultipleEmptyLines to false for EmptyLineSeparator rule (#1025 )	2019-11-19 18:44:42 +08:00
vinoth chandar	e4c91ed13f	[HUDI-290] Normalize test class name of all test classes (#951 )	2019-10-22 20:19:11 -07:00
Balaji Varadarajan	77f4e73615	[HUDI-121] Fix licensing issues found during RC voting by general incubator group	2019-10-16 02:09:02 -07:00
leesf	b19bed442d	[HUDI-296] Explore use of spotless to auto fix formatting errors (#945 ) - Add spotless format fixing to project - One time reformatting for conformity - Build fails for formatting changes and mvn spotless:apply autofixes them	2019-10-10 05:19:40 -07:00
Balaji Varadarajan	9b66ea41fd	[HUDI-121] Remove leftover notice file and replace com.uber.hoodie with org.apache.hudi in log4j properties	2019-10-04 09:18:57 -07:00
Balaji Varadarajan	6da2f9ac7c	[HUDI-287] Address comments during review of release candidate 1. Remove LICENSE and NOTICE files in hoodie child modules. 2. Remove developers and contributor section from pom 3. Also ensure any failures in validation script is reported appropriately 4. Make hoodie parent pom consistent with that of its parent apache-21 (https://github.com/apache/maven-apache-parent/blob/apache-21/pom.xml)	2019-10-03 09:00:07 -07:00
Balaji Varadarajan	6e8a28bcae	HUDI-121 : Address comments during RC2 voting 1. Remove dnl utils jar from git 2. Add LICENSE Headers in missing files 3. Fix NOTICE and LICENSE in all HUDI packages and in top-level 4. Fix License wording in certain HUDI source files 5. Include non java/scala code in RAT licensing check 6. Use whitelist to include dependencies as part of timeline-server bundling	2019-09-30 15:42:15 -07:00
Bhavani Sudha Saktheeswaran	50a073ff57	[HUDI-271] Create QuickstartUtils for simplifying quickstart guide - This will be used in Quickstart guide (Doc changes to follow in a seperate PR). The intention is to simplify quickstart to showcase hudi APIs by writing and reading using spark datasources. - This is located in hudi-spark module intentionally to bring all the necessary classes in hudi-spark-bundle finally.	2019-09-30 15:22:18 -07:00
Vinoth Chandar	e217db56ab	[HUDI-254]: Bundle and shade databricks/avro with spark bundle - spark 2.4 onwards, spark has built in support. shading to avoid conflicts - spark 2.3 still needs this bundled, so that dropping bundle into jars folder would work	2019-09-17 12:38:51 -07:00
Balaji Varadarajan	c1e7d0e5a6	[HUDI-121] Update Release notes and fix master version	2019-09-17 09:50:30 -07:00
Balaji Varadarajan	7190c022bb	[HUDI-249] Updating Notice files	2019-09-13 13:50:58 -07:00
Balaji Varadarajan	d2525c31b7	Moving to 0.6.0-SNAPSHOT on master branch.	2019-09-13 09:58:29 -07:00
Mehrotra	0e6f078ec4	Fix logging in HoodieSparkSqlWriter	2019-09-07 07:51:11 -07:00
leesf	8b150a3c6b	[HUDI-230] Add missing Apache License in some files	2019-08-30 09:38:28 -07:00
Balaji Varadarajan	5f9fa82f47	HUDI-124 : Exclude jdk.tools from hadoop-common and update Notice files (#858 )	2019-08-28 16:20:47 -07:00
Balaji Varadarajan	c265b4948f	HUDI-128 Preparing POM for release and snapshot builds (#851 )	2019-08-26 08:52:36 -07:00
vinoth chandar	cd090871a1	[HUDI-159]: Pom cleanup and removal of com.twitter.parquet - Redo all classes based on org.parquet only - remove unuused dependencies like parquet-hadoop, common-configuration2 - timeline-service does not build a fat jar anymore - Fix utilities and hadoop-mr bundles based on above	2019-08-25 16:01:14 -07:00
vinoth chandar	6edf0b9def	[HUDI-68] Pom cleanup & demo automation (#846 ) - [HUDI-172] Cleanup Maven POM/Classpath - Fix ordering of dependencies in poms, to enable better resolution - Idea is to place more specific ones at the top - And place dependencies which use them below them - [HUDI-68] : Automate demo steps on docker setup - Move hive queries from hive cli to beeline - Standardize on taking query input from text command files - Deltastreamer ingest, also does hive sync in a single step - Spark Incremental Query materialized as a derived Hive table using datasource - Fix flakiness in HDFS spin up and output comparison - Code cleanup around streamlining and loc reduction - Also fixed pom to not shade some hive classs in spark, to enable hive sync	2019-08-22 20:18:50 -07:00
Balaji Varadarajan	a4f9d7575f	HUDI-123 Rename code packages/constants to org.apache.hudi (#830 ) - Rename com.uber.hoodie to org.apache.hudi - Flag to pass com.uber.hoodie Input formats for hoodie-sync - Works with HUDI demo. - Also tested for backwards compatibility with datasets built by com.uber.hoodie packages - Migration guide : https://cwiki.apache.org/confluence/display/HUDI/Migration+Guide+From+com.uber.hoodie+to+org.apache.hudi	2019-08-11 17:48:17 -07:00

40 Commits