Satish Kotha
462fd02556
[HUDI-571] Add 'commits show archived' command to CLI
2020-02-05 11:25:34 -08:00
Raymond Xu
c1516df8ac
[HUDI-499] Allow update partition path with GLOBAL_BLOOM ( #1187 )
...
* Handle partition path update by deleting a record from the old partition and
insert into the new one
* Add a new configuration "hoodie.bloom.index.update.partition.path" to
enable the behavior
* Add a new unit test case for global bloom index
2020-02-05 09:33:33 -08:00
Suneel Marthi
594da28fbf
[HUDI-595] code cleanup, refactoring code out of PR# 1159 ( #1302 )
2020-02-04 21:52:03 +08:00
Satish Kotha
d07ac588ac
Increase test coverage for HoodieReadClient
2020-02-03 18:11:57 -08:00
Suneel Marthi
5b7bb142dc
[HUDI-583] Code Cleanup, remove redundant code, and other changes ( #1237 )
2020-02-02 18:03:44 +08:00
lamber-ken
c06ec8bfc7
[MINOR] Fix assigning to configuration more times ( #1291 )
2020-01-29 17:18:35 -05:00
leesf
6e59c1c777
Moving to 0.5.2-SNAPSHOT on master branch.
2020-01-20 10:51:33 -08:00
Balaji Varadarajan
ba54a7e973
[HUDI-559] : Make the timeline layout version default to be null version
2020-01-20 00:02:55 -08:00
wenningd
292c1e2ff4
[HUDI-238] Make Hudi support Scala 2.12 ( #1226 )
...
* [HUDI-238] Rename scala related artifactId & add maven profile to support Scala 2.12
2020-01-17 14:02:21 -08:00
Balaji Varadarajan
923e2b4a1e
[HUDI-535] Ensure Compaction Plan is always written in .aux folder to avoid 0.5.0/0.5.1 reader-writer compatibility issues ( #1229 )
2020-01-17 10:56:35 -08:00
vinoth chandar
c2c0f6b13d
[HUDI-509] Renaming code in sync with cWiki restructuring ( #1212 )
...
- Storage Type replaced with Table Type (remaining instances)
- View types replaced with query types;
- ReadOptimized view referred as Snapshot Query
- TableFileSystemView sub interfaces renamed to BaseFileOnly and Slice Views
- HoodieDataFile renamed to HoodieBaseFile
- Hive Sync tool will register RO tables for MOR with a `_ro` suffix
- Datasource/Deltastreamer options renamed accordingly
- Support fallback to old config values as well, so migration is painless
- Config for controlling _ro suffix addition
- Renaming DataFile to BaseFile across DTOs, HoodieFileSlice and AbstractTableFileSystemView
2020-01-16 23:58:47 -08:00
Y Ethan Guo
b39458b008
[MINOR] Make constant fields final in HoodieTestDataGenerator ( #1234 )
2020-01-16 12:42:30 +08:00
Balajee Nagasubramaniam
dd09abb56d
[HUDI-335] Improvements to DiskBasedMap used by ExternalSpillableMap, for write and random/sequential read paths, by introducing bufferedRandmomAccessFile
2020-01-15 16:45:45 -08:00
lamber-ken
7aa3ce31e6
[MINOR] Fix redundant judgment statement ( #1231 )
2020-01-15 16:30:14 -08:00
Mehrotra
2bb0c21a3d
Fix conversion of Spark struct type to Avro schema
...
cr https://code.amazon.com/reviews/CR-17184364
2020-01-14 00:27:56 -08:00
lamber-ken
fd8f1c70c0
[MINOR] Reuse random object ( #1222 )
2020-01-13 18:26:04 -08:00
liujianhui
c1f8acab34
[HUDI-526] fix the HoodieAppendHandle
2020-01-13 10:44:31 -08:00
lamber-ken
d9675c4ec0
[HUDI-522] Use the same version jcommander uniformly ( #1214 )
2020-01-12 10:48:52 -08:00
lamber-ken
017ee8e661
[MINOR] Fix partition typo ( #1209 )
2020-01-12 09:35:55 +08:00
pratyakshsharma
3c90d252cc
[HUDI-114]: added option to overwrite payload implementation in hoodie.properties file
2020-01-09 22:34:40 -08:00
hongdd
5af3dc6aed
[HUDI-331]Fix java docs for all public apis in HoodieWriteClient ( #1111 )
2020-01-09 16:00:53 +08:00
Wenning Ding
aba83876e7
Update deprecated HBase API
2020-01-08 10:26:47 -08:00
vinoth chandar
9706f659db
[HUDI-508] Standardizing on "Table" instead of "Dataset" across code ( #1197 )
...
- Docs were talking about storage types before, cWiki moved to "Table"
- Most of code already has HoodieTable, HoodieTableMetaClient - correct naming
- Replacing renaming use of dataset across code/comments
- Few usages in comments and use of Spark SQL DataSet remain unscathed
2020-01-07 12:52:32 -08:00
Balaji Varadarajan
8306f749a2
[HUDI-417] Refactor HoodieWriteClient so that commit logic can be shareable by both bootstrap and normal write operations ( #1166 )
2020-01-06 20:11:48 -08:00
Abhishek Modi
b5df6723a2
[HUDI-464] Use Hive Exec Core for tests ( #1125 )
2020-01-06 16:32:55 -08:00
hejinbiao123
b9fab0b933
Revert "[HUDI-455] Redo hudi-client log statements using SLF4J ( #1145 )" ( #1181 )
...
This reverts commit e637d9ed26 .
2020-01-06 21:13:29 +08:00
Sivabalan Narayanan
7031445eb3
[HUDI-377] Adding Delete() support to DeltaStreamer ( #1073 )
...
- Provides ability to perform hard deletes by writing delete marker records into the source data
- if the record contains a special field _hoodie_delete_marker set to true, deletes are performed
2020-01-04 11:07:31 -08:00
SteNicholas
726ae47ce2
[MINOR]Optimize hudi-client module ( #1139 )
2020-01-04 10:57:08 -08:00
Pratyaksh Sharma
dde21e7315
[HUDI-402]: code clean up in test cases
2019-12-31 11:10:49 -08:00
hejinbiao123
e637d9ed26
[HUDI-455] Redo hudi-client log statements using SLF4J ( #1145 )
...
* [HUDI-455] Redo hudi-client log statements using SLF4J
2019-12-31 13:49:34 +08:00
lamber-ken
ab6ae5cebb
[HUDI-482] Fix missing @Override annotation on methods ( #1156 )
...
* [HUDI-482] Fix missing @Override annotation on methods
2019-12-31 11:44:56 +08:00
dengziming
2a823f32ee
[MINOR]: alter some wrong params which bring fatal exception
2019-12-30 16:50:12 -08:00
lamber-ken
8440482977
Fix empty content clean plan
2019-12-29 19:03:56 -08:00
lamber-ken
2f254163d4
Skip setting commit metadata
2019-12-29 19:03:56 -08:00
lamber-ken
179837e8ef
Fix checkstyle
2019-12-29 19:03:56 -08:00
lamber-ken
58c5bed40a
[HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table
2019-12-29 19:03:56 -08:00
Mathieu
3c811ec29b
[MINOR] fix typos
2019-12-25 20:26:16 +08:00
Sivabalan Narayanan
9c4217a3e1
[HUDI-389] Fixing Index look up to return right partitions for a given key along with fileId with Global Bloom ( #1091 )
...
* Fixing Index look up to return partitions for a given key along with fileId with Global Bloom
* Addressing some of the comments
* Fixing test in TestHoodieGlobalBloomIndex to test the fix
2019-12-24 20:56:30 -08:00
dengziming
94aec965f5
[minor] Fix few typos in the java docs ( #1132 )
2019-12-24 20:44:11 -08:00
Mathieu
41f36770e0
[MINOR] fix typo
2019-12-25 06:48:15 +08:00
Thinking Chen
8172197c35
Fix Error: java.lang.IllegalArgumentException: Can not create a Path from an empty string in HoodieCopyOnWrite#deleteFilesFunc ( #1126 )
...
same link in https://github.com/apache/incubator-hudi/pull/771
this time is in HoodieCopyOnWrite deleteFilesFunc method
2019-12-24 14:29:28 +08:00
Sivabalan Narayanan
14881e99e0
[HUDI-106] Adding support for DynamicBloomFilter ( #976 )
...
- Introduced configs for bloom filter type
- Implemented dynamic bloom filter with configurable max number of keys
- BloomFilterFactory abstractions; Defaults to current simple bloom filter
2019-12-17 19:06:24 -08:00
Balaji Varadarajan
9a1f698eef
[HUDI-308] Avoid Renames for tracking state transitions of all actions on dataset
2019-12-15 21:26:30 -08:00
lamber-ken
ba514cfea0
[MINOR] Remove redundant plus operator ( #1097 )
2019-12-12 05:42:05 +08:00
Pratyaksh Sharma
3790b75e05
[HUDI-368] code clean up in TestAsyncCompaction class ( #1050 )
2019-12-11 05:52:41 +08:00
lamber-ken
d447e2d751
[checkstyle] Unify LOG form ( #1092 )
2019-12-10 19:23:38 +08:00
Wenning Ding
e555aa516d
[HUDI-353] Add hive style partitioning path
2019-12-09 12:29:53 -08:00
lamber-ken
2745b7552f
[HUDI-379] Refactor the codes based on new JavadocStyle code style rule ( #1079 )
2019-12-06 12:59:28 +08:00
lamber-ken
c06d89b648
[HUDI-378] Refactor the rest codes based on new ImportOrder code style rule ( #1078 )
2019-12-05 17:25:03 +08:00
lamber-ken
b3e0ebbc4a
[checkstyle] Add ConstantName java checkstyle rule ( #1066 )
...
* add SimplifyBooleanExpression java checkstyle rule
* collapse empty tags in scalastyle file
2019-12-04 18:59:15 +08:00