lamber-ken
2745b7552f
[HUDI-379] Refactor the codes based on new JavadocStyle code style rule ( #1079 )
2019-12-06 12:59:28 +08:00
lamber-ken
c06d89b648
[HUDI-378] Refactor the rest codes based on new ImportOrder code style rule ( #1078 )
2019-12-05 17:25:03 +08:00
lamber-ken
b3e0ebbc4a
[checkstyle] Add ConstantName java checkstyle rule ( #1066 )
...
* add SimplifyBooleanExpression java checkstyle rule
* collapse empty tags in scalastyle file
2019-12-04 18:59:15 +08:00
Gurudatt Kulkarni
b2d9638bea
[HUDI-365] Refactor hudi-cli based on new ImportOrder code style rule ( #1076 )
2019-12-04 15:10:40 +08:00
vinoyang
84602c8882
[HUDI-355] Refactor hudi-common based on new comment and code style rules ( #1049 )
...
[HUDI-355] Refactor hudi-common based on new comment and code style rules
2019-12-03 20:49:13 -08:00
leesf
98ab33bb6e
[HUDI-294] Delete Paths written in Cleaner plan needs to be relative to partition-path ( #1062 )
...
[HUDI-294] Delete Paths written in Cleaner plan needs to be relative to partition-path
2019-12-03 10:11:03 -08:00
leesf
845e261658
[MINOR] Update some urls from http to https in the README file ( #1074 )
2019-12-03 17:35:27 +08:00
lamber-ken
89f0968097
[DOCS] Update the build source link ( #1071 )
2019-12-03 09:56:09 +08:00
lamber-ken
ff688107fa
[DOCS] Update Hudi Readme ( #1058 )
...
- Add build status
- Clean up layout
2019-12-02 09:25:43 -08:00
lamber-ken
784e3ad0b6
[HUDI-370] Refactor hudi-common based on new ImportOrder code style rule ( #1063 )
2019-12-02 06:59:09 +08:00
Gurudatt Kulkarni
75132c139f
[HUDI-357] Refactor hudi-cli based on new comment and code style rules ( #1051 )
2019-11-30 11:12:41 -08:00
hongdd
b65a897856
[HUDI-374] Unable to generateUpdates in QuickstartUtils ( #1059 )
2019-11-30 11:11:00 -08:00
lamber-ken
024230fbd2
[HUDI-372] Support the shortName for Hudi DataSource ( #1054 )
...
- Ability to do `spark.write.format("hudi")...`
2019-11-30 08:02:33 -08:00
ForwardXu
0b52ae3ac2
[HUDI-209] Implement JMX metrics reporter ( #1045 )
2019-11-28 19:17:34 +08:00
lamber-ken
da8d1334ee
[HUDI-373] Refactor hudi-client based on new ImportOrder code style rule ( #1056 )
2019-11-28 09:25:56 +08:00
谢磊
f9139c0f61
[HUDI-366] Refactor some module codes based on new ImportOrder code style rule ( #1055 )
...
[HUDI-366] Refactor hudi-hadoop-mr / hudi-timeline-service / hudi-spark / hudi-integ-test / hudi- utilities based on new ImportOrder code style rule
2019-11-27 21:32:43 +08:00
谢磊
b77fad39b5
[HUDI-364] Refactor hudi-hive based on new ImportOrder code style rule ( #1048 )
...
[HUDI-364] Refactor hudi-hive based on new ImportOrder code style rule
2019-11-27 16:30:37 +08:00
wenningd
d6e83e8f49
[HUDI-325] Fix Hive partition error for updated HDFS Hudi table ( #1001 )
2019-11-26 21:18:39 -08:00
bschell
60fed21dc7
[HUDI-327] Add null/empty checks to key generators ( #1040 )
...
* Adds null and empty checks to all key generators.
* Also improves error messaging for key generator issues.
2019-11-26 02:37:16 -08:00
Pratyaksh Sharma
2a4cfb47c7
[HUDI-340]: made max events to read from kafka source configurable ( #1039 )
2019-11-26 18:34:02 +08:00
hongdd
a7e07cd910
[HUDI-359] Add hudi-env for hudi-cli module ( #1042 )
2019-11-25 13:25:42 -08:00
谢磊
212282c8aa
[HUDI-358] Add Java-doc and importOrder checkstyle rule ( #1043 )
...
- import groups are separated by one blank line
- org.apache.hudi.* at the top location
2019-11-25 11:36:23 -08:00
hongdd
44823041a3
[HUDI-362] Adds a check for the existence of field ( #1047 )
2019-11-25 11:31:07 -08:00
filippo balicchia
845a0509b3
[MINOR] Some minor optimizations in HoodieJavaStreamingApp ( #1046 )
2019-11-25 18:49:13 +08:00
Sivabalan Narayanan
c3355109b1
[HUDI-328] Adding delete api to HoodieWriteClient ( #1004 )
...
[HUDI-328] Adding delete api to HoodieWriteClient and Spark DataSource
2019-11-22 15:05:25 -08:00
hongdd
7bc08cbfdc
[HUDI-345] Fix used deprecated function ( #1024 )
...
- Schema.parse() with new Schema.Parser().parse
- FSDataOutputStream constructor
2019-11-22 03:32:09 -08:00
Gurudatt Kulkarni
17eaf41c54
[HUDI-348] Add Issue template for the project ( #1029 )
2019-11-22 03:25:52 -08:00
Pratyaksh Sharma
1e14390719
[HUDI-350]: updated default value of config.getCleanerCommitsRetained() in javadocs
2019-11-20 06:50:04 -08:00
lamber-ken
d9fbe33339
[HOTFIX] Fix error configuration item of dockerfile-maven-plugin
2019-11-19 16:30:03 -08:00
谢磊
804e348d0e
[HUDI-346] Set allowMultipleEmptyLines to false for EmptyLineSeparator rule ( #1025 )
2019-11-19 18:44:42 +08:00
谢磊
66492498f7
[HUDI-342] add pull request template for hudi project ( #1022 )
2019-11-19 14:06:01 +08:00
b_rousseau
e806eb797f
[HUDI-339] Add support of Azure cloud storage ( #1019 )
...
- Add Azure WASB (BLOB) and ADLS storage in StorageSchemes enum
- Update testStorageSchemes to test new added storage
2019-11-17 14:29:24 -08:00
Nishith Agarwal
f82e58994e
- Ensure that rollback instant is always created before the next commit instant.
...
This especially affects IncrementalPull for MOR tables since we can end up pulling in
log blocks for uncommitted data
- Ensure that generated commit instants are 1 second apart
2019-11-17 14:11:26 -08:00
Nishith Agarwal
3a05edab01
- Fixing RT queries for HiveOnSpark that causes race conditions
...
- Adding more comments to understand usage of reader/writer schema
2019-11-16 13:46:47 -08:00
谢磊
22315a887f
[HOTFIX] fix missing version of rat-plugin ( #1015 )
2019-11-13 21:18:46 -08:00
Balaji Varadarajan
f7c2f8cedc
[HUDI-329] Presto Containers for integration test must allow newly built local jars to override
2019-11-13 17:35:34 -08:00
Mehrotra
92c69f5703
Migrate integration tests to spark 2.4.4
2019-11-13 16:53:24 -08:00
lamber-ken
045fa87a3d
[HUDI-330] add EmptyStatement java checkstyle rule
2019-11-13 14:11:11 -08:00
Balaji Varadarajan
8ff06ddb0f
[HUDI-80] Leverage Commit metadata to figure out partitions to be cleaned for Cleaning by commits mode ( #1008 )
2019-11-12 06:12:44 -08:00
Udit Mehrotra
0bb5999f79
[HUDI-306] Support Glue catalog and other hive metastore implementations ( #961 )
...
- Support Glue catalog and other metastore implementations
- Remove shading from hudi utilities bundle
- Add maven profile to optionally shade hive in utilities bundle
2019-11-11 17:27:31 -08:00
Balaji Varadarajan
1032fc3e54
[HUDI-137] Hudi cleaning state changes should be consistent with compaction actions
...
Before this change, Cleaner performs cleaning of old file versions and then stores the deleted files in .clean files.
With this setup, we will not be able to track file deletions if a cleaner fails after deleting files but before writing .clean metadata.
This is fine for regular file-system view generation but Incremental timeline syncing relies on clean/commit/compaction metadata to keep a consistent file-system view.
Cleaner state transitions is now similar to that of compaction.
1. Requested : HoodieWriteClient.scheduleClean() selects the list of files that needs to be deleted and stores them in metadata
2. Inflight : HoodieWriteClient marks the state to be inflight before it starts deleting
3. Completed : HoodieWriteClient marks the state after completing the deletion according to the cleaner plan
2019-11-11 10:40:16 -08:00
Sivabalan Narayanan
23b303e4b1
[HUDI-218] Adding Presto support to Integration Test ( #1003 )
2019-11-11 06:21:49 -08:00
Pratyaksh Sharma
5f1309407a
[HUDI-253]: added validations for schema provider class ( #995 )
2019-11-11 06:03:44 -08:00
vinoth chandar
1483b97018
[DOCS] Change Hudi acronyms to plural
2019-11-10 12:39:58 -08:00
Jeff G
1ce3d891ce
[DOCS] Update to align with original Uber whitepaper ( #999 )
2019-11-10 12:38:13 -08:00
pratyakshsharma
0863b1cfd9
[HUDI-245]: replaced instances of getInstants() and reverse() with getReverseOrderedInstants() ( #1000 )
2019-11-07 08:42:48 -08:00
pratyakshsharma
20871a17b2
[HUDI-302]: simplified countInstants() method in HoodieDefaultTimeline ( #997 )
2019-11-06 12:56:09 -08:00
Gurudatt Kulkarni
71ac2c0d5e
[HUDI-324] TimestampKeyGenerator should support milliseconds ( #993 )
2019-11-05 04:22:14 -08:00
Bhavani Sudha Saktheeswaran
04834817c8
[MINOR] Add features and instructions to build Hudi in README ( #992 )
2019-11-03 01:48:06 -08:00
Raymond Xu
91740635b2
[HUDI-321] Support bulkinsert in HDFSParquetImporter ( #987 )
...
- Add bulk insert feature
- Fix some minor issues
2019-11-02 23:12:44 -07:00