lanyuanxiaoyao/hudi - hudi - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Balaji Varadarajan	3a210ef08e	Disable Notice Plugin	2019-06-18 11:33:26 -07:00
Nishith Agarwal	129e433641	- Ugrading to Hive 2.x - Eliminating in-memory deltaRecordsMap - Use writerSchema to generate generic record needed by custom payloads - changes to make tests work with hive 2.x	2019-06-13 12:46:14 -07:00
Balaji Varadarajan	479908fd20	HUDI-125 : Change License for all source files and update RAT configurations	2019-06-09 11:41:55 -07:00
Balaji Varadarajan	30b0f2636f	Changes related to Licensing work 1. Go through dependencies list one round to ensure compliance. Generated current NOTICE list in all submodules (other apache projects like flink does this). To be on conservative side regarding licensing, NOTICE.txt lists all dependencies including transitive. Pending Compliance questions reported in https://issues.apache.org/jira/browse/LEGAL-461 2. Automate generating NOTICE.txt files to allow future package compliance issues be identified early as part of code-review process. 3. Added NOTICE.txt and LICENSE.txt to all HUDI jars	2019-06-07 17:58:57 -07:00
guanjianhui	b325cbff10	set codehaus.jackson modules to the same version 1.9.13	2019-06-07 11:33:43 -07:00
Vinoth Chandar	acd74129cd	Create hoodie-utilities-bundle to host the shaded jar - hoodie-utilities can now be pulled in as compile time dependency - Lets users test their DeltaStreamer transformers for e.g - Tested the docker demo works & takes in the bundle - Doc changes to follow, to move DeltaStreamer commands to bundle jar	2019-05-30 22:46:24 -07:00
Vinoth Chandar	3b916ec1af	Add support for maven deploy plugin to make snapshot releases	2019-05-30 21:35:12 -07:00
guanjianhui	6b5abb5d92	fix maven pom	2019-05-29 16:16:29 -07:00
vinothchandar	66c0b81b49	[maven-release-plugin] prepare for next development iteration	2019-05-28 19:17:26 -07:00
vinothchandar	227785c022	[maven-release-plugin] prepare release hoodie-0.4.7	2019-05-28 19:17:15 -07:00
Vinoth Chandar	e43efa042f	Downgrading fasterxml jackson to 2.6.7 to be spark compatible	2019-05-16 13:53:54 -07:00
Balaji Varadarajan	64fec64097	Timeline Service with Incremental View Syncing support	2019-05-16 13:25:33 -07:00
vinothchandar	446f99aa0f	[maven-release-plugin] prepare for next development iteration	2019-05-14 07:29:22 -07:00
vinothchandar	cc38abecc8	[maven-release-plugin] prepare release hoodie-0.4.6	2019-05-14 07:29:11 -07:00
Balaji Varadarajan	ee1feb7c75	Revert "HUDI-101: added mevn-shade plugin with filters." Creates fat jars for all hoodie packages This reverts commit `f47f0eb6cb`.	2019-05-05 18:39:38 -07:00
Abhishek Sharma	f47f0eb6cb	HUDI-101: added mevn-shade plugin with filters.	2019-05-03 13:49:51 -07:00
Balaji Varadarajan	adc8cac743	Fix hive sync (libfb version mismatch) and deltastreamer issue (missing cmdline argument) in demo	2019-03-13 16:14:32 -07:00
Vinoth Chandar	363df2c12e	Upgrade various jar, gem versions for maintenance	2019-03-01 10:14:00 -08:00
vinothchandar	687395e40f	[maven-release-plugin] prepare for next development iteration	2019-02-27 07:16:27 -08:00
vinothchandar	bbf40ef987	[maven-release-plugin] prepare release hoodie-0.4.5	2019-02-27 07:16:15 -08:00
Bhavani Sudha Saktheeswaran	75c7a2622b	Create hoodie-presto bundle jar Exclude common dependencies that are available in Presto	2019-02-24 19:49:02 -08:00
Balaji Varadarajan	3a0044216c	New Features in DeltaStreamer : (1) Apply transformation when using delta-streamer to ingest data. (2) Add Hudi Incremental Source for Delta Streamer (3) Allow delta-streamer config-property to be passed as command-line (4) Add Hive Integration to Delta-Streamer and address Review comments (5) Ensure MultiPartKeysValueExtractor handle hive style partition description (6) Reuse same spark session on both source and transformer (7) Support extracting partition fields from _hoodie_partition_path for HoodieIncrSource (8) Reuse Binary Avro coders (9) Add push down filter for Incremental source (10) Add Hoodie DeltaStreamer metrics to track total time taken	2019-02-11 18:22:05 -08:00
arukavytsia	6946dd7557	General enhancements	2018-12-18 12:52:39 -08:00
Vinoth Chandar	0015c9b00e	Update committership for balaji	2018-11-30 16:23:10 -08:00
vinoth chandar	0a200c32e5	Reflect new committership, id changes for devs	2018-10-02 11:00:50 +05:30
Balaji Varadarajan	f3418e4718	Docker Container Build and Run setup with foundations for adding docker integration tests. Docker images built with Hadoop 2.8.4 Hive 2.3.3 and Spark 2.3.1 and published to docker-hub Look at quickstart document for how to setup docker and run demo	2018-10-02 09:28:21 +05:30
vinothchandar	b5a75fdd91	Adding Jiale & Anbu to contributors list	2018-09-29 20:20:28 +05:30
vinothchandar	7ba842c0fe	[maven-release-plugin] prepare for next development iteration	2018-09-28 11:27:00 +05:30
vinothchandar	5847b61f44	[maven-release-plugin] prepare release hoodie-0.4.4	2018-09-28 11:26:15 +05:30
vinothchandar	9ca6f91e97	Perform consistency checks during write finalize - Check to ensure written files are listable on storage - Docs reflected to capture how this helps with s3 storage - Unit tests added, corrections to existing tests - Fix DeltaStreamer to manage archived commits in a separate folder	2018-09-28 08:04:41 +05:30
Balaji Varadarajan	4c74dd4cad	Travis CI tests needs to be run in quieter mode (WARN log level) to avoid max log-size errors	2018-09-26 21:10:20 +05:30
Yishuang Lu	faf93b6340	Fix the name of avro schema file in Test Fixed the name of avro schema file in Test Signed-off-by: Yishuang Lu <luystu@gmail.com>	2018-09-24 21:58:34 +05:30
Vinoth Chandar	bd5af89f12	[maven-release-plugin] rollback the release of hoodie-0.4.4	2018-09-13 15:01:53 +05:30
Vinoth Chandar	d1cc864a43	[maven-release-plugin] prepare for next development iteration	2018-09-12 23:59:47 +05:30
Vinoth Chandar	b748bc836d	[maven-release-plugin] prepare release hoodie-0.4.4	2018-09-12 23:59:34 +05:30
Vinoth Chandar	a5359662be	Moving depedencies off cdh to apache + Hive2 support - Tests redone in the process - Main changes are to RealtimeRecordReader and how it treats maps/arrays - Make hive sync work with Hive 1/2 and CDH environments - Fixes to make corner cases for Hive queries - Spark Hive integration - Working version across Apache and CDH versions - Known Issue - https://github.com/uber/hudi/issues/439	2018-09-11 11:03:30 +05:30
Vinoth Chandar	d58ddbd999	Reworking the deltastreamer tool - Standardize version of jackson - DFSPropertiesConfiguration replaces usage of commons PropertiesConfiguration - Remove dependency on ConstructorUtils - Throw error if ordering value is not present, during key generation - Switch to shade plugin for hoodie-utilities - Added support for consumption for Confluent avro kafka serdes - Support for Confluent schema registry - KafkaSource now deals with skews nicely, by doing round robin allocation of source limit across partitions - Added support for BULK_INSERT operations as well - Pass in the payload class config properly into HoodieWriteClient - Fix documentation based on new usage - Adding tests on deltastreamer, sources and all new util classes.	2018-09-08 10:24:32 +08:00
Nishith Agarwal	324de298bc	Removing dependency on apache-commons lang 3, adding necessary classes as needed	2018-09-06 08:26:48 +08:00
Saravanan Elumalai	2eaa42abde	Updated jcommander version to fix NPE in HoodieDeltaStreamer tool	2018-08-31 07:28:13 -07:00
Vinoth Chandar	89cd6b0726	[maven-release-plugin] prepare for next development iteration	2018-08-22 21:30:05 -07:00
Vinoth Chandar	8d305c5a86	[maven-release-plugin] prepare release hoodie-0.4.3	2018-08-22 21:29:53 -07:00
Vinoth Chandar	34827d50e1	[maven-release-plugin] prepare for next development iteration	2018-06-11 08:59:13 -07:00
Vinoth Chandar	43ef385730	[maven-release-plugin] prepare release hoodie-0.4.2	2018-06-11 08:59:02 -07:00
Balaji Varadarajan	788e4f2d2e	CodeStyle formatting to conform to basic Checkstyle rules. The code-style rules follow google style with some changes: 1. Increase line length from 100 to 120 2. Disable JavaDoc related checkstyles as this needs more manual work. Both source and test code are checked for code-style	2018-03-30 11:09:40 -07:00
Vinoth Chandar	73534d467f	[maven-release-plugin] prepare for next development iteration	2018-03-07 21:04:10 -08:00
Vinoth Chandar	f2e5c6f9f8	[maven-release-plugin] prepare release hoodie-0.4.1	2018-03-07 21:04:00 -08:00
Vinoth Chandar	e45679f5e2	Reformatting code per Google Code Style all over	2017-11-12 23:19:02 -08:00
Vinoth Chandar	e1fe3ab937	[maven-release-plugin] prepare for next development iteration	2017-10-02 22:42:54 -07:00
Vinoth Chandar	50139fe904	[maven-release-plugin] prepare release hoodie-0.4.0	2017-10-02 22:42:32 -07:00
Vinoth Chandar	64e0573aca	Adding hoodie-spark to support Spark Datasource for Hoodie - Write with COW/MOR paths work fully - Read with RO view works on both storages* - Incremental view supported on COW - Refactored out HoodieReadClient methods, to just contain key based access - HoodieDataSourceHelpers class can be now used to construct inputs to datasource - Tests in hoodie-client using new helpers and mechanisms - Basic tests around save modes & insert/upserts (more to follow) - Bumped up scala to 2.11, since 2.10 is deprecated & complains with scalatest - Updated documentation to describe usage - New sample app written using the DataSource API	2017-10-02 20:44:53 -07:00

1 2 3

111 Commits