Go to file

Balaji Varadarajan 1b61f04e05 (1) Define CompactionWorkload in avro to allow storing them in instant files.

(2) Split APIs in HoodieRealtimeCompactor to separate generating compaction workload from running compaction

2018-08-07 08:19:50 -07:00

deploy

Add ossrh profile to publish maven artifacts to oss.sonatype.org (synced with maven central)

2016-12-21 14:17:35 -08:00

docs

Fixing deps & serialization for RTView

2018-06-10 19:16:44 -07:00

hoodie-cli

[maven-release-plugin] prepare for next development iteration

2018-06-11 08:59:13 -07:00

hoodie-client

(1) Define CompactionWorkload in avro to allow storing them in instant files.

2018-08-07 08:19:50 -07:00

hoodie-common

(1) Define CompactionWorkload in avro to allow storing them in instant files.

2018-08-07 08:19:50 -07:00

hoodie-hadoop-mr

FileSystemView and Timeline level changes to support Async Compaction

2018-08-07 08:19:50 -07:00

hoodie-hive

[maven-release-plugin] prepare for next development iteration

2018-06-11 08:59:13 -07:00

hoodie-spark

Adding ability for inserts to be written to log files

2018-06-11 14:08:59 -07:00

hoodie-utilities

[maven-release-plugin] prepare for next development iteration

2018-06-11 08:59:13 -07:00

style

CodeStyle formatting to conform to basic Checkstyle rules.

2018-03-30 11:09:40 -07:00

_config.yml

Set theme jekyll-theme-minimal

2016-12-29 16:53:39 -08:00

.gitignore

Importing Hoodie Client from internal repo

2016-12-16 14:34:42 -08:00

.travis.yml

Update java version to 8 in travis.yml

2017-05-17 13:43:11 -07:00

CHANGELOG.md

Added CHANGELOG.md and updated community contributions guideline

2017-06-16 10:48:37 -07:00

LICENSE.txt

Importing Hoodie Client from internal repo

2016-12-16 14:34:42 -08:00

pom.xml

[maven-release-plugin] prepare for next development iteration

2018-06-11 08:59:13 -07:00

README.md

Update README.md

2017-12-10 07:50:37 -08:00

RELEASE_NOTES.md

Update Release notes for 0.4.2 release

2018-06-11 08:41:11 -07:00

README.md

Hudi

Hudi (pronounced Hoodie) stands for Hadoop Upserts anD Incrementals. Hudi manages storage of large analytical datasets on HDFS and serve them out via two types of tables

Read Optimized Table - Provides excellent query performance via purely columnar storage (e.g. Parquet)
Near-Real time Table (WIP) - Provides queries on real-time data, using a combination of columnar & row based storage (e.g Parquet + Avro)

For more, head over here

Languages

Java 81.4%

Scala 16.7%

ANTLR 0.9%

Shell 0.8%

Dockerfile 0.2%