Go to file

vinoth chandar 7a973a6944 [HUDI-159] Redesigning bundles for lighter-weight integrations

- Documented principles applied for redesign at packaging/README.md
 - No longer depends on incl commons-codec, commons-io, commons-pool, commons-dbcp, commons-lang, commons-logging, avro-mapred
 - Introduce new FileIOUtils & added checkstyle rule for illegal import of above
 - Parquet, Avro dependencies moved to provided scope to enable being picked up from Hive/Spark/Presto instead
 - Pickup jackson jars for Hive sync tool from HIVE_HOME & unbundling jackson everywhere
 - Remove hive-jdbc standalone jar from being bundled in Spark/Hive/Utilities bundles
 - 6.5x reduced number of classes across bundles

2019-09-11 11:08:27 -07:00

deploy

[HUDI-230] Add missing Apache License in some files

2019-08-30 09:38:28 -07:00

docker

[HUDI-230] Add missing Apache License in some files

2019-08-30 09:38:28 -07:00

hudi-cli

[HUDI-159] Redesigning bundles for lighter-weight integrations

2019-09-11 11:08:27 -07:00

hudi-client

[HUDI-159] Redesigning bundles for lighter-weight integrations

2019-09-11 11:08:27 -07:00

hudi-common

[HUDI-159] Redesigning bundles for lighter-weight integrations

2019-09-11 11:08:27 -07:00

hudi-hadoop-mr

[HUDI-159] Redesigning bundles for lighter-weight integrations

2019-09-11 11:08:27 -07:00

hudi-hive

[HUDI-159] Redesigning bundles for lighter-weight integrations

2019-09-11 11:08:27 -07:00

hudi-integ-test

[HUDI-159] Redesigning bundles for lighter-weight integrations

2019-09-11 11:08:27 -07:00

hudi-spark

Fix logging in HoodieSparkSqlWriter

2019-09-07 07:51:11 -07:00

hudi-timeline-service

[HUDI-225] Create Hudi Timeline Server Fat Jar

2019-08-29 20:03:06 -07:00

hudi-utilities

[HUDI-159] Redesigning bundles for lighter-weight integrations

2019-09-11 11:08:27 -07:00

packaging

[HUDI-159] Redesigning bundles for lighter-weight integrations

2019-09-11 11:08:27 -07:00

release/config

[HUDI-230] Add missing Apache License in some files

2019-08-30 09:38:28 -07:00

style

[HUDI-159] Redesigning bundles for lighter-weight integrations

2019-09-11 11:08:27 -07:00

tools

[HUDI-230] Add missing Apache License in some files

2019-08-30 09:38:28 -07:00

_config.yml

[HUDI-230] Add missing Apache License in some files

2019-08-30 09:38:28 -07:00

.gitignore

[HUDI-68] Pom cleanup & demo automation (#846 )

2019-08-22 20:18:50 -07:00

.travis.yml

[HUDI-230] Add missing Apache License in some files

2019-08-30 09:38:28 -07:00

CHANGELOG.md

Added CHANGELOG.md and updated community contributions guideline

2017-06-16 10:48:37 -07:00

KEYS

Adding GPG Keys

2019-08-12 12:49:10 -07:00

LICENSE.txt

Importing Hoodie Client from internal repo

2016-12-16 14:34:42 -08:00

NOTICE.txt

[HUDI-225] Create Hudi Timeline Server Fat Jar

2019-08-29 20:03:06 -07:00

pom.xml

[HUDI-159] Redesigning bundles for lighter-weight integrations

2019-09-11 11:08:27 -07:00

README.md

[HUDI-181] Fix the Bold markdown grammar issue of README file (#808 )

2019-07-30 03:47:53 -07:00

RELEASE_NOTES.md

Release notes for 0.4.7

2019-05-28 18:28:59 -07:00

README.md

Hudi

Apache Hudi (pronounced Hoodie) stands for Hadoop Upserts anD Incrementals. Hudi manages storage of large analytical datasets on DFS (Cloud stores, HDFS or any Hadoop FileSystem compatible storage) and provide ability to query them via three types of views

Read Optimized View - Provides excellent query performance via purely columnar storage (e.g. Parquet)
Incremental View - Provides a change stream with records inserted or updated after a point in time.
Real time View - Provides queries on real-time data, using a combination of columnar & row based storage (e.g Parquet + Avro)

For more, head over here

Languages

Java 81.4%

Scala 16.7%

ANTLR 0.9%

Shell 0.8%

Dockerfile 0.2%