Suneel Marthi
fa36082554
[HUDI-746] Reduce build warnings < 10 ( #1465 )
2020-03-30 11:46:52 +08:00
vinoth chandar
e057c27603
[HUDI-744] Restructure hudi-common and clean up files under util packages ( #1462 )
...
- Brings more order and cohesion to the classes in hudi-common
- Utils classes related to a particular concept (avro, timeline,...) are placed near to the package
- common.fs package now contains all the filesystem level classes including wrapper filesystem
- bloom.filter package renamed to just bloom
- config package contains classes that help store properties
- common.fs.inline package contains all the inline filesystem classes/impl
- common.table.timeline now consolidates all timeline related classes
- common.table.view consolidates all the classes related to filesystem view metadata
- common.table.timeline.versioning contains all classes related to versioning of timeline
- Fix few unit tests as a result
- Moved the test packages around to match the source file move
- Rename AvroUtils to TimelineMetadataUtils & minor fixes/typos
2020-03-29 10:58:49 -07:00
leesf
07c3c5d797
[HUDI-679] Make io package Spark free ( #1460 )
...
* [HUDI-679] Make io package Spark free
2020-03-29 16:54:00 +08:00
Sivabalan Narayanan
ac73bdcdc3
[HUDI-430] Adding InlineFileSystem to support embedding any file format as an InlineFile ( #1176 )
...
* Adding InlineFileSystem to support embedding any file format (parquet, hfile, etc). Supports reading the embedded file using respective readers.
2020-03-28 12:13:35 -04:00
Suneel Marthi
04449f33fe
[HUDI-743]: Remove FileIOUtils.close() ( #1461 )
2020-03-28 18:03:15 +08:00
Suneel Marthi
8c3001363d
HUDI-479: Eliminate or Minimize use of Guava if possible ( #1159 )
2020-03-28 03:11:32 -04:00
Raymond Xu
1713f686f8
[MINOR] Add error message when check arguments ( #1451 )
2020-03-27 10:21:38 +08:00
leesf
8b0a4009a9
[HUDI-678] Make config package spark free ( #1418 )
2020-03-26 08:30:27 -07:00
Mathieu
5eed6c98a8
[MINOR] Fix javadoc of InsertBucket ( #1445 )
2020-03-25 22:25:47 +08:00
hongdd
cafc87041b
[HUDI-697]Add unit test for ArchivedCommitsCommand ( #1424 )
2020-03-23 13:46:10 +08:00
Zhiyuan Zhao
0241b21f77
[HUDI-65] commitTime rename to instantTime ( #1431 )
2020-03-22 18:06:00 -07:00
Zhiyuan Zhao
06652aa935
[MINOR] Add omissive param desc on method doc and cleanup redundant code ( #1437 )
2020-03-22 21:39:33 +08:00
satishkotha
83fb9651f3
[HUDI-650] Modify handleUpdate path to validate partitionPath ( #1368 )
2020-03-20 08:37:22 -07:00
Sivabalan Narayanan
a752b7b18c
Merge pull request #1165 from yihua/HUDI-76-deltastreamer-csv-source
...
[HUDI-76] Add CSV Source support for Hudi Delta Streamer
2020-03-19 10:00:53 -04:00
ForwardXu
1e321c2fc0
[HUDI-209] Implement JMX metrics reporter ( #1106 )
2020-03-19 20:10:35 +08:00
leesf
0a4902ecce
[HUDI-437] Support user-defined index ( #1408 )
...
* [hotfix] set default value for index class config
* class config takes precedence over `hoodie.index.type`
2020-03-17 19:27:40 -07:00
Y Ethan Guo
cf765df606
[HUDI-76] Add CSV Source support for Hudi Delta Streamer
2020-03-15 19:03:37 -07:00
Suneel Marthi
99b7e9eb9e
[HUDI-629]: Replace Guava's Hashing with an equivalent in NumericUtils.java ( #1350 )
...
* [HUDI-629]: Replace Guava's Hashing with an equivalent in NumericUtils.java
2020-03-13 20:28:05 -04:00
vinoth chandar
fb7fba365f
[HUDI-646] fix failing test due to improper filesytem cleanup ( #1373 )
2020-03-12 23:59:09 -07:00
Prashant Wason
7d66831444
[MINOR] Removing code which is duplicated from the base class HoodieWriteHandle. ( #1399 )
2020-03-11 16:43:04 -07:00
Sivabalan Narayanan
1ca912af09
[HUDI-667] Fixing delete tests for DeltaStreamer ( #1395 )
2020-03-11 16:19:23 -07:00
Prashant Wason
77d5b92d88
[HUDI-668] Added additional unit-tests for HUDI metrics. ( #1380 )
2020-03-09 23:15:42 -04:00
hongdd
f93e64fee4
[HUDI-681]Remove embeddedTimelineService from HoodieReadClient ( #1388 )
...
* [HUDI-681]Remove embeddedTimelineService from HoodieReadClient
2020-03-09 18:31:04 +08:00
Prashant Wason
5f8bf97005
[HUDI-671] Added unit-test for HBaseIndex ( #1381 )
2020-03-07 16:48:43 -08:00
vinoyang
ee5b32f5d4
[HUDI-652] Decouple HoodieReadClient and AbstractHoodieClient to break the inheritance chain ( #1372 )
...
* Removed timeline server support
* Removed try-with-resource
2020-03-06 09:59:35 -08:00
hongdd
8306205d7a
[HUDI-332]Add operation type (insert/upsert/bulkinsert/delete) to HoodieCommitMetadata ( #1157 )
...
[HUDI-332]Add operation type (insert/upsert/bulkinsert/delete) to HoodieCommitMetadata (#1157 )
2020-03-03 10:10:29 -08:00
yanghua
0dc8e493aa
Moving to 0.6.0-SNAPSHOT on master branch.
2020-03-01 15:08:30 +08:00
vinoth chandar
71170fafe7
[HUDI-554] Cleanup package structure in hudi-client ( #1346 )
...
- Just package, class moves and renames with the following intent
- `client` now has all the various client classes, that do the transaction management
- `func` renamed to `execution` and some helpers moved to `client/utils`
- All compaction code under `io` now under `table/compact`
- Rollback code under `table/rollback` and in general all code for individual operations under `table`
- `exception` `config`, `metrics` left untouched
- Moved the tests also accordingly
- Fixed some flaky tests
2020-02-27 08:05:58 -08:00
Suneel Marthi
078d4825d9
[HUDI-624]: Split some of the code from PR for HUDI-479 ( #1344 )
2020-02-21 14:22:21 +08:00
Suneel Marthi
f9d2f66dc1
[HUDI-622]: Remove VisibleForTesting annotation and import from code ( #1343 )
...
* HUDI:622: Remove VisibleForTesting annotation and import from code
2020-02-20 15:17:53 +08:00
Sivabalan Narayanan
00493235f5
[HUDI-108] Removing 2GB spark partition limitations in HoodieBloomIndex with spark 2.4.4 ( #1315 )
2020-02-18 11:12:20 -08:00
Suneel Marthi
24e73816b2
[MINOR] Code Cleanup, remove redundant code ( #1337 )
2020-02-15 22:03:29 +08:00
wangxianghu
aaa6cf9a98
[MINOR] Fix some typos
2020-02-15 09:49:25 +08:00
Satish Kotha
462fd02556
[HUDI-571] Add 'commits show archived' command to CLI
2020-02-05 11:25:34 -08:00
Raymond Xu
c1516df8ac
[HUDI-499] Allow update partition path with GLOBAL_BLOOM ( #1187 )
...
* Handle partition path update by deleting a record from the old partition and
insert into the new one
* Add a new configuration "hoodie.bloom.index.update.partition.path" to
enable the behavior
* Add a new unit test case for global bloom index
2020-02-05 09:33:33 -08:00
Suneel Marthi
594da28fbf
[HUDI-595] code cleanup, refactoring code out of PR# 1159 ( #1302 )
2020-02-04 21:52:03 +08:00
Satish Kotha
d07ac588ac
Increase test coverage for HoodieReadClient
2020-02-03 18:11:57 -08:00
Suneel Marthi
5b7bb142dc
[HUDI-583] Code Cleanup, remove redundant code, and other changes ( #1237 )
2020-02-02 18:03:44 +08:00
lamber-ken
c06ec8bfc7
[MINOR] Fix assigning to configuration more times ( #1291 )
2020-01-29 17:18:35 -05:00
leesf
6e59c1c777
Moving to 0.5.2-SNAPSHOT on master branch.
2020-01-20 10:51:33 -08:00
Balaji Varadarajan
ba54a7e973
[HUDI-559] : Make the timeline layout version default to be null version
2020-01-20 00:02:55 -08:00
wenningd
292c1e2ff4
[HUDI-238] Make Hudi support Scala 2.12 ( #1226 )
...
* [HUDI-238] Rename scala related artifactId & add maven profile to support Scala 2.12
2020-01-17 14:02:21 -08:00
Balaji Varadarajan
923e2b4a1e
[HUDI-535] Ensure Compaction Plan is always written in .aux folder to avoid 0.5.0/0.5.1 reader-writer compatibility issues ( #1229 )
2020-01-17 10:56:35 -08:00
vinoth chandar
c2c0f6b13d
[HUDI-509] Renaming code in sync with cWiki restructuring ( #1212 )
...
- Storage Type replaced with Table Type (remaining instances)
- View types replaced with query types;
- ReadOptimized view referred as Snapshot Query
- TableFileSystemView sub interfaces renamed to BaseFileOnly and Slice Views
- HoodieDataFile renamed to HoodieBaseFile
- Hive Sync tool will register RO tables for MOR with a `_ro` suffix
- Datasource/Deltastreamer options renamed accordingly
- Support fallback to old config values as well, so migration is painless
- Config for controlling _ro suffix addition
- Renaming DataFile to BaseFile across DTOs, HoodieFileSlice and AbstractTableFileSystemView
2020-01-16 23:58:47 -08:00
Y Ethan Guo
b39458b008
[MINOR] Make constant fields final in HoodieTestDataGenerator ( #1234 )
2020-01-16 12:42:30 +08:00
Balajee Nagasubramaniam
dd09abb56d
[HUDI-335] Improvements to DiskBasedMap used by ExternalSpillableMap, for write and random/sequential read paths, by introducing bufferedRandmomAccessFile
2020-01-15 16:45:45 -08:00
lamber-ken
7aa3ce31e6
[MINOR] Fix redundant judgment statement ( #1231 )
2020-01-15 16:30:14 -08:00
Mehrotra
2bb0c21a3d
Fix conversion of Spark struct type to Avro schema
...
cr https://code.amazon.com/reviews/CR-17184364
2020-01-14 00:27:56 -08:00
lamber-ken
fd8f1c70c0
[MINOR] Reuse random object ( #1222 )
2020-01-13 18:26:04 -08:00
liujianhui
c1f8acab34
[HUDI-526] fix the HoodieAppendHandle
2020-01-13 10:44:31 -08:00