lanyuanxiaoyao/hudi - hudi - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Raymond Xu	14323cb100	[HUDI-344] Improve exporter tests (#1404 )	2020-03-15 20:24:30 +08:00
Suneel Marthi	99b7e9eb9e	[HUDI-629]: Replace Guava's Hashing with an equivalent in NumericUtils.java (#1350 ) * [HUDI-629]: Replace Guava's Hashing with an equivalent in NumericUtils.java	2020-03-13 20:28:05 -04:00
Sivabalan Narayanan	1ca912af09	[HUDI-667] Fixing delete tests for DeltaStreamer (#1395 )	2020-03-11 16:19:23 -07:00
openopen2	44700d531a	[HUDI-344] Hudi Dataset Snapshot Exporter (#1360 ) Co-authored-by: jason1993 <261049174@qq.com>	2020-03-10 09:17:51 +08:00
hongdd	f93e64fee4	[HUDI-681]Remove embeddedTimelineService from HoodieReadClient (#1388 ) * [HUDI-681]Remove embeddedTimelineService from HoodieReadClient	2020-03-09 18:31:04 +08:00
lamber-ken	ccbf543607	[HUDI-654] Rename hudi-hive to hudi-hive-sync	2020-03-06 22:13:16 +08:00
yanghua	0dc8e493aa	Moving to 0.6.0-SNAPSHOT on master branch.	2020-03-01 15:08:30 +08:00
vinoth chandar	71170fafe7	[HUDI-554] Cleanup package structure in hudi-client (#1346 ) - Just package, class moves and renames with the following intent - `client` now has all the various client classes, that do the transaction management - `func` renamed to `execution` and some helpers moved to `client/utils` - All compaction code under `io` now under `table/compact` - Rollback code under `table/rollback` and in general all code for individual operations under `table` - `exception` `config`, `metrics` left untouched - Moved the tests also accordingly - Fixed some flaky tests	2020-02-27 08:05:58 -08:00
Suneel Marthi	078d4825d9	[HUDI-624]: Split some of the code from PR for HUDI-479 (#1344 )	2020-02-21 14:22:21 +08:00
Suneel Marthi	f9d2f66dc1	[HUDI-622]: Remove VisibleForTesting annotation and import from code (#1343 ) * HUDI:622: Remove VisibleForTesting annotation and import from code	2020-02-20 15:17:53 +08:00
amitsingh-10	c2b08cdfc9	[HUDI-617] Add support for types implementing CharSequence (#1339 ) - Data types extending CharSequence implement a #toString method which provides an easy way to convert them to String. - For example, org.apache.avro.util.Utf8 is easily convertible into String if we use the toString() method. It's better to make the support more generic to support a wider range of data types as partitionKey.	2020-02-18 11:19:44 -08:00
Mathieu	8c6138cb01	[MINOR] Add javadoc to SchedulerConfGenerator and code clean (#1340 )	2020-02-18 11:15:02 -08:00
wangxianghu	aaa6cf9a98	[MINOR] Fix some typos	2020-02-15 09:49:25 +08:00
openopen2	dfbee673ef	[HUDI-514] A schema provider to get metadata through Jdbc (#1200 )	2020-02-13 18:06:06 -08:00
Mathieu	175de0db7b	[MINOR] Fix typo (#1331 )	2020-02-13 10:46:10 -08:00
Mathieu	5fdf5a1927	[HUDI-560] Remove legacy IdentityTransformer (#1264 )	2020-02-10 10:04:58 +08:00
lamber-ken	46842f4e92	[MINOR] Remove the declaration of thrown RuntimeException (#1305 )	2020-02-05 23:23:20 +08:00
lamber-ken	425e3e6c78	[HUDI-585] Optimize the steps of building with scala-2.12 (#1293 )	2020-02-05 23:13:10 +08:00
Suneel Marthi	594da28fbf	[HUDI-595] code cleanup, refactoring code out of PR# 1159 (#1302 )	2020-02-04 21:52:03 +08:00
dengziming	347e297ac1	[HUDI-596] Close KafkaConsumer every time (#1303 )	2020-02-03 23:42:21 -08:00
Suneel Marthi	5b7bb142dc	[HUDI-583] Code Cleanup, remove redundant code, and other changes (#1237 )	2020-02-02 18:03:44 +08:00
leesf	ed54eb20a5	[MINOR] Add missing licenses (#1271 )	2020-01-22 08:06:45 -05:00
leesf	6e59c1c777	Moving to 0.5.2-SNAPSHOT on master branch.	2020-01-20 10:51:33 -08:00
Y Ethan Guo	9489d0fb84	[HUDI-551] Abstract a test case class for DFS Source to make it extensible (#1239 )	2020-01-19 18:50:12 +08:00
Y Ethan Guo	d0ee95ed16	[HUDI-552] Fix the schema mismatch in Row-to-Avro conversion (#1246 )	2020-01-18 16:40:56 -08:00
wenningd	292c1e2ff4	[HUDI-238] Make Hudi support Scala 2.12 (#1226 ) * [HUDI-238] Rename scala related artifactId & add maven profile to support Scala 2.12	2020-01-17 14:02:21 -08:00
vinoth chandar	c2c0f6b13d	[HUDI-509] Renaming code in sync with cWiki restructuring (#1212 ) - Storage Type replaced with Table Type (remaining instances) - View types replaced with query types; - ReadOptimized view referred as Snapshot Query - TableFileSystemView sub interfaces renamed to BaseFileOnly and Slice Views - HoodieDataFile renamed to HoodieBaseFile - Hive Sync tool will register RO tables for MOR with a `_ro` suffix - Datasource/Deltastreamer options renamed accordingly - Support fallback to old config values as well, so migration is painless - Config for controlling _ro suffix addition - Renaming DataFile to BaseFile across DTOs, HoodieFileSlice and AbstractTableFileSystemView	2020-01-16 23:58:47 -08:00
Y Ethan Guo	b39458b008	[MINOR] Make constant fields final in HoodieTestDataGenerator (#1234 )	2020-01-16 12:42:30 +08:00
Scheller	1daba24065	Add GlobalDeleteKeyGenerator Adds new GlobalDeleteKeyGenerator for record_key deletes with global indices. Also refactors key generators into their own package.	2020-01-15 17:01:29 -08:00
Mehrotra	2bb0c21a3d	Fix conversion of Spark struct type to Avro schema cr https://code.amazon.com/reviews/CR-17184364	2020-01-14 00:27:56 -08:00
lamber-ken	fd8f1c70c0	[MINOR] Reuse random object (#1222 )	2020-01-13 18:26:04 -08:00
openopen2	a44c61b813	[HUDI-502] provide a custom time zone definition for TimestampBasedKeyGenerator (#1188 )	2020-01-12 15:45:23 -08:00
harveyyue	971c7d41bd	[HUDI-322] DeltaSteamer should pick checkpoints off only deltacommits for MOR tables	2020-01-12 15:11:47 -08:00
Udit Mehrotra	ad50008a59	[HUDI-91][HUDI-12]Migrate to spark 2.4.4, migrate to spark-avro library instead of databricks-avro, add support for Decimal/Date types - Upgrade Spark to 2.4.4, Parquet to 1.10.1, Avro to 1.8.2 - Remove spark-avro from hudi-spark-bundle. Users need to provide --packages org.apache.spark:spark-avro:2.4.4 when running spark-shell or spark-submit - Replace com.databricks:spark-avro with org.apache.spark:spark-avro - Shade avro in hudi-hadoop-mr-bundle to make sure it does not conflict with hive's avro version.	2020-01-12 15:03:11 -08:00
lamber-ken	d9675c4ec0	[HUDI-522] Use the same version jcommander uniformly (#1214 )	2020-01-12 10:48:52 -08:00
pratyakshsharma	3c90d252cc	[HUDI-114]: added option to overwrite payload implementation in hoodie.properties file	2020-01-09 22:34:40 -08:00
vinoth chandar	9706f659db	[HUDI-508] Standardizing on "Table" instead of "Dataset" across code (#1197 ) - Docs were talking about storage types before, cWiki moved to "Table" - Most of code already has HoodieTable, HoodieTableMetaClient - correct naming - Replacing renaming use of dataset across code/comments - Few usages in comments and use of Spark SQL DataSet remain unscathed	2020-01-07 12:52:32 -08:00
Abhishek Modi	b5df6723a2	[HUDI-464] Use Hive Exec Core for tests (#1125 )	2020-01-06 16:32:55 -08:00
lamber-ken	75c3f630d4	[HUDI-405] Remove HIVE_ASSUME_DATE_PARTITION_OPT_KEY config from DataSource	2020-01-06 14:25:38 -08:00
Pratyaksh Sharma	8f935e779a	[HUDI-406]: added default partition path in TimestampBasedKeyGenerator	2020-01-06 09:38:06 -08:00
lamber-ken	28ccf8c521	[HUDI-484] Fix NPE when reading IncrementalPull.sqltemplate in HiveIncrementalPuller (#1167 )	2020-01-04 23:53:47 -08:00
Sivabalan Narayanan	7031445eb3	[HUDI-377] Adding Delete() support to DeltaStreamer (#1073 ) - Provides ability to perform hard deletes by writing delete marker records into the source data - if the record contains a special field _hoodie_delete_marker set to true, deletes are performed	2020-01-04 11:07:31 -08:00
Pratyaksh Sharma	290278fc6c	[HUDI-118]: Options provided for passing properties to Cleaner, compactor and importer commands	2020-01-03 16:00:57 -08:00
lamber-ken	e1e5fe3324	[MINOR] Fix error usage of String.format (#1169 )	2020-01-02 09:11:15 +08:00
Pratyaksh Sharma	dde21e7315	[HUDI-402]: code clean up in test cases	2019-12-31 11:10:49 -08:00
lamber-ken	ab6ae5cebb	[HUDI-482] Fix missing @Override annotation on methods (#1156 ) * [HUDI-482] Fix missing @Override annotation on methods	2019-12-31 11:44:56 +08:00
yungthuis66	f20a130e3a	[MINOR] typo fix (#1142 )	2019-12-26 09:03:43 -08:00
vinoth chandar	350b0ecb4d	[HUDI-311] : Support for AWS Database Migration Service in DeltaStreamer - Add a transformer class, that adds `Op` fiels if not found in input frame - Add a payload implementation, that issues deletes when Op=D - Remove Parquet as a top level source type, consolidate with RowSource - Made delta streamer work without a property file, simply using overridden cli options - Unit tests for transformer/payload classes	2019-12-23 20:56:55 -08:00
lamber-ken	ba514cfea0	[MINOR] Remove redundant plus operator (#1097 )	2019-12-12 05:42:05 +08:00
lamber-ken	d447e2d751	[checkstyle] Unify LOG form (#1092 )	2019-12-10 19:23:38 +08:00

1 2

90 Commits