1
0
Commit Graph

36 Commits

Author SHA1 Message Date
Wei Yan
c4fa585b27 Switch some info log to debug (#83)
* Switch some info log to debug

* fix a typo

* remote HoodieTableMetadata file
2017-02-23 20:12:36 -08:00
Prasanna Rajaperumal
fe5c5e8021 Test Failure in Travis-ci 2017-02-21 20:25:01 -08:00
Prasanna Rajaperumal
1132f3533d Merge and pull master commits 2017-02-21 17:53:28 -08:00
prazanna
eb46e7c72b Implement Merge on Read Storage (#76)
1. Create HoodieTable abstraction for commits and fileSystemView
2. HoodieMergeOnReadTable created
3. View is now always obtained from the table and the correct view based on the table type is returned
2017-02-21 16:24:38 -08:00
Prasanna Rajaperumal
ccd8cb2407 Take 2: Refactor hoodie-common and create right abstractions for Hoodie Storage V2.0
- Refactored timelines to be a single timeline for all active events and one for archived events. CommitTimeline and other timelines can be inferred by applying a filter on the activeTimelime
- Introduced HoodieInstant to abstract different types of action, commit time and if isInFlight
- Implemented other review comments
2017-02-21 16:23:53 -08:00
Prasanna Rajaperumal
8ee777a9bb Refactor hoodie-common and create right abstractions for Hoodie Storage V2.0
The following is the gist of changes done

- All low-level operation of creating a commit code was in HoodieClient which made it hard to share code if there was a compaction commit.
- HoodieTableMetadata contained a mix of metadata and filtering files. (Also few operations required FileSystem to be passed in because those were called from TaskExecutors and others had FileSystem as a global variable). Since merge-on-read requires a lot of that code, but will have to change slightly on how it operates on the metadata and how it filters the files. The two set of operation are split into HoodieTableMetaClient and TableFileSystemView.
- Everything (active commits, archived commits, cleaner log, save point log and in future delta and compaction commits) in HoodieTableMetaClient is a HoodieTimeline. Timeline is a series of instants, which has an in-built concept of inflight and completed commit markers.
- A timeline can be queries for ranges, contains and also use to create new datapoint (create a new commit etc). Commit (and all the above metadata) creation/deletion is streamlined in a timeline
- Multiple timelines can be merged into a single timeline, giving us an audit timeline to whatever happened in a hoodie dataset. This also helps with #55.
- Move to java 8 and introduce java 8 succinct syntax in refactored code
2017-02-21 16:23:53 -08:00
Prasanna Rajaperumal
283269e57f [maven-release-plugin] prepare for next development iteration 2017-02-20 16:52:25 -08:00
Prasanna Rajaperumal
d5a5f2ddff [maven-release-plugin] prepare release hoodie-0.3.0 2017-02-20 16:52:04 -08:00
Prasanna Rajaperumal
be1dd9444f [maven-release-plugin] prepare for next development iteration 2017-02-20 16:09:05 -08:00
Prasanna Rajaperumal
47583e280f [maven-release-plugin] prepare release hoodie-0.2.14 2017-02-20 16:08:45 -08:00
Prasanna Rajaperumal
2d49711cce Changing the current development version to 0.2.14-SNAPSHOT 2017-02-20 16:01:24 -08:00
Prasanna Rajaperumal
4a47d26818 Fixing a javadoc lint issue 2017-02-20 15:57:58 -08:00
Prasanna Rajaperumal
cc58a4c3e0 [maven-release-plugin] prepare for next development iteration 2017-02-20 15:49:45 -08:00
Prasanna Rajaperumal
dd03038254 [maven-release-plugin] prepare release hoodie-0.2.13 2017-02-20 15:49:20 -08:00
Prasanna Rajaperumal
57a0b7a781 [maven-release-plugin] prepare for next development iteration 2017-02-20 15:35:19 -08:00
Prasanna Rajaperumal
9828bd8019 [maven-release-plugin] prepare release hoodie-0.2.12 2017-02-20 15:35:03 -08:00
Prasanna Rajaperumal
8f12163166 [maven-release-plugin] prepare for next development iteration 2017-02-20 15:00:35 -08:00
Prasanna Rajaperumal
6e6f6efb94 [maven-release-plugin] prepare release hoodie-0.2.11 2017-02-20 15:00:16 -08:00
vinoth chandar
54409b07ea Implement HoodieROTablePathFilter for use with Spark 2.0 (#66)
- Unit tested
 - Tested with few queries on Spark 2.x at Uber
2017-01-26 11:13:33 -08:00
Prasanna Rajaperumal
693d751506 [maven-release-plugin] prepare for next development iteration 2017-01-10 22:27:35 -08:00
Prasanna Rajaperumal
e9866bb4d9 [maven-release-plugin] prepare release hoodie-0.2.10 2017-01-10 22:27:28 -08:00
Prasanna Rajaperumal
1ced46ab3e [maven-release-plugin] prepare for next development iteration 2017-01-05 20:04:35 -08:00
Prasanna Rajaperumal
e9f0d4d0bf [maven-release-plugin] prepare release hoodie-0.2.9 2017-01-05 20:04:28 -08:00
Prasanna Rajaperumal
7171ea6909 [maven-release-plugin] prepare for next development iteration 2017-01-05 19:43:31 -08:00
Prasanna Rajaperumal
c1f2d1e456 [maven-release-plugin] prepare release hoodie-0.2.8 2017-01-05 19:43:25 -08:00
Prasanna Rajaperumal
2475610246 Build hoodie-hadoop-mr as a fat jar - include hoodie common 2017-01-04 14:52:38 -08:00
Vinoth Chandar
dd8638c2cc Annotate HoodieInputFormat with UseFileSplitsFromInputFormat
- Tested with corresponding Presto changes (pending PR)
2017-01-04 11:19:25 -08:00
Prasanna Rajaperumal
ee854da9b2 [maven-release-plugin] prepare for next development iteration 2016-12-21 17:38:57 -08:00
Prasanna Rajaperumal
587aaffcf6 [maven-release-plugin] prepare release hoodie-0.2.7 2016-12-21 17:38:50 -08:00
Prasanna Rajaperumal
12885ab60a [maven-release-plugin] prepare for next development iteration 2016-12-21 16:48:36 -08:00
Prasanna Rajaperumal
d42f235bbc [maven-release-plugin] prepare release hoodie-0.2.6 2016-12-21 16:48:30 -08:00
Prasanna Rajaperumal
5f9be58576 [maven-release-plugin] prepare for next development iteration 2016-12-21 14:45:37 -08:00
Prasanna Rajaperumal
5362e92886 [maven-release-plugin] prepare release hoodie-0.2.5 2016-12-21 14:45:31 -08:00
Prasanna Rajaperumal
5d44ae3dbd Make hoodie run on travis-ci
Test logs > 4MB, which is a limit for travis-ci. Reducing the logs by setting appropriate log levels for tests
Add sudo: required on travis.yml to get more memory for running the tests. (https://github.com/travis-ci/travis-ci/issues/5926)
Fixed requirement that fsclient.lastDataFileForDataset always returns files in order
2016-12-20 19:26:48 -08:00
Prasanna Rajaperumal
388457b6b2 Add hoodie-hive module to support hive registration of hoodie datasets 2016-12-19 23:05:39 -08:00
Prasanna Rajaperumal
61200b1207 Adding hoodie-hadoop-mr module to add HoodieInputFormat 2016-12-16 19:29:53 -08:00