hudi/hoodie-spark/pom.xml at 8ad8030f2a0b9da3cd3cdecc21992803ce8a37fc

Files

Vinoth Chandar 85dd265b7b Improving out of box experience for data source

- Fixes #246
 - Bump up default parallelism to 1500, to handle large upserts
 - Add docs on s3 confuration & tuning tips with tested spark knobs
 - Fix bug to not duplicate hoodie metadata fields when input dataframe is another hoodie dataset
 - Improve speed of ROTablePathFilter by removing directory check
 - Move to spark-avro 4.0 to handle issue with nested fields with same name
 - Keep AvroConversionUtils in sync with spark-avro 4.0

2018-06-10 19:16:44 -07:00

6.7 KiB

Raw Blame History

View Raw

6.7 KiB Raw Blame History

6.7 KiB

Raw Blame History