Go to file

Ho Tien Vu e48e35385a Added preemptive check for 'spark.scheduler.mode'

When running docker demo, NoSuchElementException was thrown because spark.scheduler.mode is not set.
Also we want to check before initializing the Spark Context to avoid polute the SparkConf
with unused config.

2019-06-25 13:39:41 -07:00

deploy

HUDI-125 : Change License for all source files and update RAT configurations

2019-06-09 11:41:55 -07:00

docker

HUDI-125 : Change License for all source files and update RAT configurations

2019-06-09 11:41:55 -07:00

hoodie-cli

HUDI-125 : Change License for all source files and update RAT configurations

2019-06-09 11:41:55 -07:00

hoodie-client

Adding support for optional skipping single archiving failures

2019-06-20 22:54:45 -07:00

hoodie-common

Add maprfs to storage schemes

2019-06-20 22:45:35 -07:00

hoodie-hadoop-mr

- Ugrading to Hive 2.x

2019-06-13 12:46:14 -07:00

hoodie-hive

- Ugrading to Hive 2.x

2019-06-13 12:46:14 -07:00

hoodie-integ-test

HUDI-125 : Change License for all source files and update RAT configurations

2019-06-09 11:41:55 -07:00

hoodie-spark

adding support for complex keys (#728 )

2019-06-21 00:25:06 -07:00

hoodie-timeline-service

Ensure TableMetaClient and FileSystem instances have exclusive copy of Configuration

2019-06-20 14:05:00 -07:00

hoodie-utilities

Added preemptive check for 'spark.scheduler.mode'

2019-06-25 13:39:41 -07:00

packaging

HUDI-70 : Making DeltaStreamer run in continuous mode with concurrent compaction

2019-06-18 17:48:14 -07:00

release/config

- Ugrading to Hive 2.x

2019-06-13 12:46:14 -07:00

style

General enhancements

2018-12-18 12:52:39 -08:00

_config.yml

Set theme jekyll-theme-minimal

2016-12-29 16:53:39 -08:00

.gitignore

Bucketized Bloom Filter checking

2019-05-11 16:38:28 -07:00

.travis.yml

Auto generated Slack Channel Notifications setup

2019-06-07 06:46:00 -07:00

CHANGELOG.md

Added CHANGELOG.md and updated community contributions guideline

2017-06-16 10:48:37 -07:00

KEYS

HUDI-75: Add KEYS

2019-03-18 07:46:25 -07:00

LICENSE.txt

Importing Hoodie Client from internal repo

2016-12-16 14:34:42 -08:00

NOTICE.txt

Changes related to Licensing work

2019-06-07 17:58:57 -07:00

pom.xml

HUDI-70 : Making DeltaStreamer run in continuous mode with concurrent compaction

2019-06-18 17:48:14 -07:00

README.md

Update README.md

2019-06-17 18:19:34 -07:00

RELEASE_NOTES.md

Release notes for 0.4.7

2019-05-28 18:28:59 -07:00

README.md

Hudi

Apache Hudi (pronounced Hoodie) stands for Hadoop Upserts anD Incrementals. Hudi manages storage of large analytical datasets on DFS (Cloud stores, HDFS or any Hadoop FileSystem compatible storage) and provide ability to query them via three types of views

Read Optimized View - Provides excellent query performance via purely columnar storage (e.g. Parquet)
Incremental View - Provides a change stream with records inserted or updated after a point in time.
**Real time View ** - Provides queries on real-time data, using a combination of columnar & row based storage (e.g Parquet + Avro)

For more, head over here

Languages

Java 81.4%

Scala 16.7%

ANTLR 0.9%

Shell 0.8%

Dockerfile 0.2%