1
0
Commit Graph

563 Commits

Author SHA1 Message Date
Balaji Varadarajan
6da2f9ac7c [HUDI-287] Address comments during review of release candidate
1. Remove LICENSE and NOTICE files in hoodie child modules.
  2. Remove developers and contributor section from pom
  3. Also ensure any failures in validation script is reported appropriately
  4. Make hoodie parent pom consistent with that of its parent apache-21 (https://github.com/apache/maven-apache-parent/blob/apache-21/pom.xml)
2019-10-03 09:00:07 -07:00
Balaji Varadarajan
e41835fd91 [HUDI-121] Fix bugs in Release Scripts found during RC creation 2019-10-01 21:34:24 -07:00
Balaji Varadarajan
1d2e3dcc65 [HUDI-121] Update Release notes 2019-09-30 16:24:45 -07:00
Balaji Varadarajan
6e8a28bcae HUDI-121 : Address comments during RC2 voting
1. Remove dnl utils jar from git
2. Add LICENSE Headers in missing files
3. Fix NOTICE and LICENSE in all HUDI packages and in top-level
4. Fix License wording in certain HUDI source files
5. Include non java/scala code in RAT licensing check
6. Use whitelist to include dependencies as part of timeline-server bundling
2019-09-30 15:42:15 -07:00
Bhavani Sudha Saktheeswaran
50a073ff57 [HUDI-271] Create QuickstartUtils for simplifying quickstart guide
- This will be used in Quickstart guide (Doc changes to follow in a seperate PR). The intention is to simplify quickstart to showcase hudi APIs by writing and reading using spark datasources.
- This is located in hudi-spark module intentionally to bring all the necessary classes in hudi-spark-bundle finally.
2019-09-30 15:22:18 -07:00
vinoyang
01e803b00e [HUDI-247] Unify the re-initialization of HoodieTableMetaClient in test for hoodie-client module (#930) 2019-09-30 05:38:52 -07:00
Balaji Varadarajan
2ea8b0c3f1 [HUDI-279] Fix regression in Schema Evolution due to PR-755 2019-09-25 22:53:43 -07:00
Xing Pan
bf05f95413 [HUDI-269] Limit sync frequency (#921)
* [HUDI-269] Throttle DeltaStreamer sync runs
2019-09-24 05:30:35 -07:00
vinoth chandar
1e6dd1ca1d [MINOR] : Further updates to .mailmap for vinoth (#919) 2019-09-22 17:26:04 -07:00
PanXing
635154c439 [MINOR] support reading cfg file in another s3 bucket (#914) 2019-09-22 06:47:23 -07:00
vinoyang
f020d029c4 HUDI-267 Refactor bad method name HoodieTestUtils#initTableType and HoodieTableMetaClient#initializePathAsHoodieDataset (#916) 2019-09-21 09:05:02 -07:00
Nishith Agarwal
1104f9526f HUDI-180 : Adding support for hive registration using metastore along with JDBC 2019-09-21 08:04:34 -07:00
Balaji Varadarajan
69ca45b2da Fix mailmap to allow mapping of defunct emailIds to active emailIds 2019-09-19 00:17:56 -07:00
Balaji Varadarajan
2e75a6973d Adding mailmap to allow mapping of defunct emailIds to active emailIds 2019-09-19 00:12:20 -07:00
Balaji Varadarajan
4bda742a93 Updating release notes and preparing for 0.5.0-incubating-rc2 release 2019-09-17 12:41:54 -07:00
Vinoth Chandar
e217db56ab [HUDI-254]: Bundle and shade databricks/avro with spark bundle
- spark 2.4 onwards, spark has built in support. shading to avoid conflicts
 - spark 2.3 still needs this bundled, so that dropping bundle into jars folder would work
2019-09-17 12:38:51 -07:00
Balaji Varadarajan
c1e7d0e5a6 [HUDI-121] Update Release notes and fix master version 2019-09-17 09:50:30 -07:00
Balaji Varadarajan
2c6da09d9d [HUDI-257] Fix Bloom Index unit-test failures 2019-09-17 09:41:15 -07:00
Balaji Varadarajan
96a46d8701 [HUDI-252] Identify packages which are under non-apache license in LICENSE file 2019-09-16 21:56:12 -07:00
Balaji Varadarajan
629698d54f [HUDI-252] Add Disclaimer and cleanup NOTICE and LICENSE files in hudi 2019-09-16 13:08:26 -07:00
Taher Koitawala
c0f42afa35 [HUDI-62] Index Lookup Timer added to HoodieWriteClient 2019-09-16 11:12:52 -07:00
Balaji Varadarajan
63cc455d9c [HUDI-121] ASF Release : Ensure NOTICE.txt is generated as part of cutting release branch 2019-09-13 13:58:29 -07:00
Balaji Varadarajan
7190c022bb [HUDI-249] Updating Notice files 2019-09-13 13:50:58 -07:00
Balaji Varadarajan
1a61d6bc4d [HUDI-249] Fix RELEASE-NOTES to match correct state 2019-09-13 12:48:38 -07:00
Vinoth Chandar
b1446be2b4 [HUDI-244] : Hive sync should escape partition field name
- now supports field names beginning with '_' for e.g
2019-09-13 12:41:52 -07:00
Balaji Varadarajan
3ee16b5439 [HUDI-250] Ensure Hudi CLI wrapper works with non snapshot jars too. Also Fix bug in cut_release_branch script 2019-09-13 12:40:32 -07:00
Nishith Agarwal
0b032b2761 Fix requested eompaction rollback during restore command 2019-09-13 12:40:13 -07:00
Balaji Varadarajan
d2525c31b7 Moving to 0.6.0-SNAPSHOT on master branch. 2019-09-13 09:58:29 -07:00
Balaji Varadarajan
36c74c8a16 [HUDI-249] Fix cut_release_branch script 2019-09-13 09:54:39 -07:00
Balaji Varadarajan
117a68d503 Add missing commit in release notes 2019-09-13 09:45:19 -07:00
Balaji Varadarajan
58623631d4 [HUDI-249] Update Release-notes. Add sign-artifacts to POM and release related scripts. Add missing license headers 2019-09-13 08:41:29 -07:00
yanghua
895d732a14 refactor code 2019-09-12 05:15:07 -07:00
yanghua
5f04241fce refactor code: add docs and init/cleanup resource group for hoodie client test base 2019-09-12 05:15:07 -07:00
yanghua
80c27f2351 Optimize hoodie client after implementat auto closeable interface 2019-09-12 05:15:07 -07:00
yanghua
90bfb900aa revert setting jsc spark configuration 2019-09-12 05:15:07 -07:00
yanghua
6f2b166005 [HUDI-217] Provide a unified resource management class to standardize the resource allocation and release for hudi client test cases 2019-09-12 05:15:07 -07:00
Bhavani Sudha Saktheeswaran
64df98fc4a [HUDI-164] Fixes incorrect averageBytesPerRecord
When number of records written is zero, averageBytesPerRecord results in a huge size (division by zero and ceiled to Long.MAX_VALUE) causing OOM. This commit fixes this issue by reverse traversing the commits until a more reasonable average record size can be computed and if that is not possible returns the default configured record size.
2019-09-11 15:20:25 -07:00
Balaji Varadarajan
93bc5e2153 HUDI-243 Rename HoodieInputFormat and HoodieRealtimeInputFormat to HoodieParquetInputFormat and HoodieParquetRealtimeInputFormat 2019-09-11 14:03:01 -07:00
Vinoth Chandar
d0b9b56b7d [HUDI-143] Excluding javax.* from utilities and spark bundles
- Plus minor code review comments
2019-09-11 11:08:27 -07:00
vinoth chandar
7a973a6944 [HUDI-159] Redesigning bundles for lighter-weight integrations
- Documented principles applied for redesign at packaging/README.md
 - No longer depends on incl commons-codec, commons-io, commons-pool, commons-dbcp, commons-lang, commons-logging, avro-mapred
 - Introduce new FileIOUtils & added checkstyle rule for illegal import of above
 - Parquet, Avro dependencies moved to provided scope to enable being picked up from Hive/Spark/Presto instead
 - Pickup jackson jars for Hive sync tool from HIVE_HOME & unbundling jackson everywhere
 - Remove hive-jdbc standalone jar from being bundled in Spark/Hive/Utilities bundles
 - 6.5x reduced number of classes across bundles
2019-09-11 11:08:27 -07:00
Mehrotra
0e6f078ec4 Fix logging in HoodieSparkSqlWriter 2019-09-07 07:51:11 -07:00
leesf
07a0ea87ab [hotfix] fix typo 2019-09-06 08:31:30 -07:00
leesf
821e0dcffc [HUDI-236] Failed to close stream 2019-09-03 19:24:11 -07:00
Alex Filipchik
555dd55c16 Support nested ordering fields 2019-08-30 13:41:16 -07:00
leesf
8b150a3c6b [HUDI-230] Add missing Apache License in some files 2019-08-30 09:38:28 -07:00
Balaji Varadarajan
376b59ae5f [HUDI-227] : DeltaStreamer Improvements : Commit empty input batch with progressing checkpoints and allow users to override configs through properties. Original PR : PR-805 and PR-806 (#863) 2019-08-30 09:13:34 -07:00
Balaji Varadarajan
a6908ef44d HUDI-170 Updating hoodie record before inserting it into ExternalSpillableMap (#866) 2019-08-30 09:03:37 -07:00
leesf
40dd4dd637 [HUDI-229] Fix mvn notice:generate issue in windows 2019-08-30 00:16:24 -07:00
leesf
5c2da6051e [HUDI-225] Create Hudi Timeline Server Fat Jar 2019-08-29 20:03:06 -07:00
Balaji Varadarajan
5f9fa82f47 HUDI-124 : Exclude jdk.tools from hadoop-common and update Notice files (#858) 2019-08-28 16:20:47 -07:00