1
0

Commit Graph

  • ecbd389a3f [HUDI-1478] Introduce HoodieBloomIndex to hudi-java-client (#2608) Shen Hong 2021-03-28 20:28:40 +08:00
  • bec70413c0 [HUDI-1728] Fix MethodNotFound for HiveMetastore Locks (#2731) n3nash 2021-03-27 10:07:10 -07:00
  • 8b774fe331 [HUDI-1495] Bump Flink version to 1.12.2 (#2718) Danny Chan 2021-03-26 14:25:57 +08:00
  • 6e803e08b1 Moving to 0.9.0-SNAPSHOT on master branch. garyli1019 2021-03-24 21:37:14 +08:00
  • 29b79c99b0 [hotfix] Log the error message for creating table source first (#2711) Danny Chan 2021-03-24 18:25:37 +08:00
  • 01a1d7997b [HUDI-1712] Rename & standardize config to match other configs (#2708) n3nash 2021-03-24 02:24:02 -07:00
  • 03668dbaf1 [HUDI-1710] Read optimized query type for Flink batch reader (#2702) Danny Chan 2021-03-24 09:41:30 +08:00
  • 0e6909d3e2 [MINOR][DOCUMENT] Update README doc for integ test (#2703) legendtkl 2021-03-23 20:21:56 +08:00
  • d7b18783bd [HUDI-1709] Improving config names and adding hive metastore uri config (#2699) n3nash 2021-03-22 01:22:06 -07:00
  • ce3e8ec870 [HUDI-1667]: Fix a null value related bug for spark vectorized reader. (#2636) Liulietong 2021-03-20 22:54:20 +08:00
  • 900de34e45 [HUDI-1650] Custom avro kafka deserializer. (#2619) Volodymyr Burenin 2021-03-20 02:51:08 -05:00
  • 161d530f93 Fixing kafka auto.reset.offsets config param key (#2691) Sivabalan Narayanan 2021-03-19 15:54:29 -04:00
  • 55a489c769 [1568] Fixing spark3 bundles (#2625) Sivabalan Narayanan 2021-03-19 14:21:36 -04:00
  • f74828fca1 [HUDI-1705] Flush as per data bucket for mini-batch write (#2695) Danny Chan 2021-03-19 16:30:54 +08:00
  • 1277c62398 [HUDI-1653] Add support for composite keys in NonpartitionedKeyGenerator (#2627) Jintao Guan 2021-03-18 15:33:31 -07:00
  • e602e5dfb9 [MINOR] Remove unused var in AbstractHoodieWriteClient (#2693) wangxianghu 2021-03-19 05:56:02 +08:00
  • d429169ff7 [HUDI-1688]hudi write should uncache rdd, when the write operation is finnished (#2673) xiarixiaoyao 2021-03-19 01:19:18 +08:00
  • f1e0018f12 [HUDI-1704] Use PRIMARY KEY syntax to define record keys for Flink Hudi table (#2694) Danny Chan 2021-03-18 20:21:52 +08:00
  • 968488fa3a [HUDI-1701] Implement HoodieTableSource.explainSource for all kinds of pushing down (#2690) Danny Chan 2021-03-17 23:05:18 +08:00
  • 74241947c1 [HUDI-845] Added locking capability to allow multiple writers (#2374) n3nash 2021-03-16 16:43:53 -07:00
  • b038623ed3 [HUDI 1615] Fixing null schema in bulk_insert row writer path (#2653) Sivabalan Narayanan 2021-03-16 12:44:11 -04:00
  • 16864aee14 [HUDI-1695] Fixed the error messaging (#2679) Vinoth Govindarajan 2021-03-15 20:30:26 -07:00
  • 3b36cb805d [HUDI-1552] Improve performance of key lookups from base file in Metadata Table. (#2494) Prashant Wason 2021-03-15 13:42:57 -07:00
  • 76bf2cc790 [HUDI-1692] Bounded source for stream writer (#2674) Danny Chan 2021-03-15 19:42:36 +08:00
  • fc6c5f4285 [HUDI-1684] Tweak hudi-flink-bundle module pom and reorganize the pacakges for hudi-flink module (#2669) Danny Chan 2021-03-15 16:02:05 +08:00
  • e93c6a5693 [HUDI-1496] Fixing input stream detection of GCS FileSystem (#2500) Sivabalan Narayanan 2021-03-14 04:57:57 -04:00
  • f5e31be086 [HUDI-1685] keep updating current date for every batch (#2671) Ankush Kanungo 2021-03-12 15:53:01 -08:00
  • 20786ab8a2 [HUDI-1681] Support object storage for Flink writer (#2662) Danny Chan 2021-03-12 16:39:24 +08:00
  • e8e6708aea [HUDI-1664] Avro schema inference for Flink SQL table (#2658) Danny Chan 2021-03-11 19:45:48 +08:00
  • 12ff562d2b [HUDI-1678] Row level delete for Flink sink (#2659) Danny Chan 2021-03-11 19:44:06 +08:00
  • 2fdae6835c [HUDI-1663] Streaming read for Flink MOR table (#2640) Danny Chan 2021-03-10 22:44:06 +08:00
  • c4a66324cd [HUDI-1651] Fix archival of requested replacecommit (#2622) satishkotha 2021-03-09 15:56:44 -08:00
  • d8af24d8a2 [HUDI-1635] Improvements to Hudi Test Suite (#2628) Balajee Nagasubramaniam 2021-03-09 13:29:38 -08:00
  • d3a451611c [MINOR] HoodieClientTestHarness close resources in AfterAll phase (#2646) Raymond Xu 2021-03-08 01:36:03 -08:00
  • 8b9dea4ad9 [HUDI-1673] Replace scala.Tule2 to Pair in FlinkHoodieBloomIndex (#2642) Shen Hong 2021-03-08 14:30:34 +08:00
  • 02073235c3 [HUDI-1662] Fix hive date type conversion for mor table (#2634) xiarixiaoyao 2021-03-08 12:16:13 +08:00
  • 5cf2f2618b [HUDI-1618] Fixing NPE with Parquet src in multi table delta streamer (#2577) Sivabalan Narayanan 2021-03-07 16:40:40 -05:00
  • 9437e0ddef [MINOR] Fix import in StreamerUtil.java (#2638) Raymond Xu 2021-03-07 12:37:15 -08:00
  • 11ad4ed26b [HUDI-1661] Exclude clustering commits from getExtraMetadataFromLatest API (#2632) satishkotha 2021-03-05 13:42:19 -08:00
  • f2159c4573 [HUDI-1660] Excluding compaction and clustering instants from inflight rollback (#2631) n3nash 2021-03-05 11:18:09 -08:00
  • bc883db5de [HUDI-1636] Support Builder Pattern To Build Table Properties For HoodieTableConfig (#2596) pengzhiwei 2021-03-05 14:10:27 +08:00
  • f53bca404f [HUDI-1655] Support custom date format and fix unsupported exception in DatePartitionPathSelector (#2621) Raymond Xu 2021-03-04 21:01:51 -08:00
  • 7cc75e0be2 [HUDI-1646] Provide mechanism to read uncommitted data through InputFormat (#2611) satishkotha 2021-03-04 17:43:31 -08:00
  • 89003bc780 [HUDI-1647] Supports snapshot read for Flink (#2613) Danny Chan 2021-03-05 08:49:32 +08:00
  • 899ae70fdb [HUDI-1587] Add latency and freshness support (#2541) Raymond Xu 2021-03-03 20:13:12 -08:00
  • f11a6c7b2d [HUDI-1553] Configuration and metrics for the TimelineService. (#2495) Prashant Wason 2021-03-02 21:58:41 -08:00
  • 4fa43359cb [MINOR] Fix default value for hoodie.deltastreamer.source.kafka.auto.reset.offsets (#2617) t0il3ts0ap 2021-03-03 07:19:18 +05:30
  • 0dde7f9185 [HUDI-1584] Modify maker file path, which should start with the target base path. (#2539) ZhangChaoMing 2021-03-02 17:52:21 +08:00
  • 73fa308ff0 [HUDI-1634] Re-bootstrap metadata table when un-synced instants have been archived. (#2595) Prashant Wason 2021-03-01 20:31:55 -08:00
  • 7a6b071647 [HUDI-1644] Do not delete older rollback instants as part of rollback. Archival can take care of removing old instants cleanly (#2610) satishkotha 2021-03-01 09:40:00 -08:00
  • 657e73f9b1 [HUDI-1540] Fixing commons codec dependency in bundle jars (#2562) Sivabalan Narayanan 2021-03-01 12:34:10 -05:00
  • 7a11de1276 [HUDI-1632] Supports merge on read write mode for Flink writer (#2593) Danny Chan 2021-03-01 12:29:41 +08:00
  • be257b58c6 [Hudi-1583]: Fix bug that Hudi will skip remaining log files if there is logFile with zero size in logFileList when merge on read. (#2584) Liulietong 2021-02-27 06:43:47 +08:00
  • 022df0d1b1 [HUDI-1611] Added a configuration to allow specific directories to be filtered out during Metadata Table bootstrap. (#2565) Prashant Wason 2021-02-25 16:52:28 -08:00
  • 9f5e8cc7c3 Fixing README for hudi test suite long running job (#2578) Sivabalan Narayanan 2021-02-25 19:50:18 -05:00
  • 8c2197ae5e [HUDI-1269] Make whether the failure of connect hive affects hudi ingest process configurable (#2443) liujinhui 2021-02-25 23:09:32 +08:00
  • 617cc24ad1 [HUDI-1367] Make deltaStreamer transition from dfsSouce to kafkasouce (#2227) liujinhui 2021-02-25 20:08:13 +08:00
  • 06dc7c7fd8 [HUDI-1638] Some improvements to BucketAssignFunction (#2600) Danny Chan 2021-02-25 14:33:21 +08:00
  • 97864a48c1 [HUDI-1637] Avoid to rename for bucket update when there is only one flush action during a checkpoint (#2599) Danny Chan 2021-02-25 10:21:27 +08:00
  • 77ba561a6b [HUDI-1347] Fix Hbase index to make rollback synchronous (via config) (#2188) hj2016 2021-02-24 09:56:58 +08:00
  • ab9933f206 [HUDI-1620] Add azure pipelines configs (#2582) Raymond Xu 2021-02-23 16:52:41 -08:00
  • 3b8d0f3b1f [MINOR] hive sync checks for table after creating db if auto create is true (#2591) Ankush Kanungo 2021-02-23 10:35:14 -08:00
  • d2f360f5dd [MINOR] Ensure directory exists before listing all marker files. (#2594) Prashant Wason 2021-02-23 08:05:59 -08:00
  • 2efd0760ac [HUDI-1477] Support copyOnWriteTable in java client (#2382) Shen Hong 2021-02-23 20:50:55 +08:00
  • 3ceb1b4c83 [HUDI-1624] The state based index should bootstrap from existing base files (#2581) Danny Chan 2021-02-23 13:37:44 +08:00
  • 43a0776c7c [HUDI-1586] [Common Core] [Flink Integration] Reduce the coupling of hadoop. (#2540) ZhangChaoMing 2021-02-21 11:54:04 +08:00
  • ffcfb58bac [HUDI-1486] Remove inline inflight rollback in hoodie writer (#2359) n3nash 2021-02-19 20:12:22 -08:00
  • c9fcf964b2 [HUDI-1315] Adding builder for HoodieTableMetaClient initialization (#2534) Sivabalan Narayanan 2021-02-19 20:54:26 -05:00
  • 0d91c451b0 [HUDI-1539] Fix bug in HoodieCombineRealtimeRecordReader with reading empty iterators (#2583) satishkotha 2021-02-19 15:45:43 -08:00
  • b0010bf3b4 [HUDI-1582] Throw an exception when syncHoodieTable() fails, with RuntimeException (#2536) Balajee Nagasubramaniam 2021-02-17 17:34:15 -08:00
  • 9431aabfab [HUDI-1381] Schedule compaction based on time elapsed (#2260) Karl_Wang 2021-02-17 23:44:53 +08:00
  • c4bbcb7f0e [HUDI-1621] Gets the parallelism from context when init StreamWriteOperatorCoordinator (#2579) lamber-ken 2021-02-17 20:04:38 +08:00
  • 37972071ff [HUDI-1109] Support Spark Structured Streaming read from Hudi table (#2485) pengzhiwei 2021-02-17 19:36:29 +08:00
  • 5d2491d10c [HUDI-1598] Write as minor batches during one checkpoint interval for the new writer (#2553) Danny Chan 2021-02-17 15:24:50 +08:00
  • 302bd29dab [MINOR] Add clustering to feature list (#2568) vinoyang 2021-02-13 23:39:14 +08:00
  • 527175ab0b [MINOR] Default to empty list for unset datadog tags property (#2574) Raymond Xu 2021-02-12 23:52:03 -08:00
  • d5f202821b Adding fixes to test suite framework. Adding clustering node and validate async operations node. (#2400) Sivabalan Narayanan 2021-02-12 12:29:21 -05:00
  • ff0e3f5669 [HUDI-1612] Fix write test flakiness in StreamWriteITCase (#2567) lamber-ken 2021-02-11 23:37:19 +08:00
  • 26da4f5462 [HUDI-1526] Translate the api partitionBy in spark datasource to hoodie.datasource.write.partitionpath.field (#2431) teeyog 2021-02-11 01:07:54 +08:00
  • a2f85d90de [MINOR] Fix the wrong comment for HoodieJavaWriteClientExample (#2559) vinoyang 2021-02-10 02:33:34 +08:00
  • 7a98b1c878 [HUDI-1603] fix DefaultHoodieRecordPayload serialization failure (#2556) Gary Li 2021-02-09 23:53:45 +08:00
  • c30481f4b0 [HUDI-1545] Add test cases for INSERT_OVERWRITE Operation (#2483) Sun Ke 2021-02-08 13:47:01 +08:00
  • 4c5b6923cc [HUDI-1557] Make Flink write pipeline write task scalable (#2506) Danny Chan 2021-02-06 22:03:52 +08:00
  • 291f92069e [MINOR] Fix wrong logic for checking state condition (#2524) ZhangChaoMing 2021-02-06 16:40:31 +08:00
  • b2c47a24be [HUDI-1589] Fix Rollback Metadata AVRO backwards incompatiblity (#2543) n3nash 2021-02-05 16:03:34 -08:00
  • b5d4a046bb [HUDI-1571] Adding commit_show_records_info to display record sizes for commit (#2514) Sivabalan Narayanan 2021-02-05 07:53:24 -05:00
  • b51b3a39a8 [HUDI-1420] HoodieTableMetaClient.getMarkerFolderPath works incorrectly on windows client with hdfs server for wrong file seperator (#2526) hiscat 2021-02-05 16:24:35 +08:00
  • 4a5683d54a [MINOR] Fixing the default value for source ordering field for payload config (#2516) Sivabalan Narayanan 2021-02-04 08:43:03 -05:00
  • 647e9faf25 [HUDI-1547] CI intermittent failure: TestJsonStringToHoodieRecordMapF… (#2521) wangxianghu 2021-02-04 11:20:01 +08:00
  • 17802569fd [HUDI-1538] Try to init class trying different signatures instead of checking its name (#2476) Volodymyr Burenin 2021-02-03 14:29:08 -06:00
  • eb91e5ba70 [HUDI-1523] Call mkdir(partition) only if not exists (#2501) Sivabalan Narayanan 2021-02-03 09:02:37 -05:00
  • d74d8e2084 [HUDI-1335] Introduce FlinkHoodieSimpleIndex to hudi-flink-client (#2271) wangxianghu 2021-02-03 08:59:49 +08:00
  • 50ff9ab2d2 [MINOR] Rename FileSystemViewHandler to RequestHandler and corrected the class comment (#2458) vinoyang 2021-02-03 01:15:53 +08:00
  • ec950b4cfe [MINOR] Fix method comment typo (#2518) jackiehff 2021-02-02 19:23:29 +08:00
  • 0d8a4d0a56 [HUDI-1550] Honor ordering field for MOR Spark datasource reader (#2497) pengzhiwei 2021-02-01 21:04:27 +08:00
  • f159c0c49a [HUDI-1519] Improve minKey/maxKey computation in HoodieHFileWriter (#2427) steven zhang 2021-02-01 20:51:57 +08:00
  • 5d053b495b [MINOR] Quickstart.generateUpdates method add check (#2505) jiangjiguang 2021-01-30 10:28:00 +08:00
  • 9cb6cb8189 [HUDI-1266] Add unit test for validating replacecommit rollback (#2418) satishkotha 2021-01-29 10:28:08 -08:00
  • 2d2d5c83b1 [HUDI-1555] Remove isEmpty to improve clustering execution performance (#2502) satishkotha 2021-01-29 10:27:09 -08:00
  • 23f2ef3efb [HUDI-623] Remove UpgradePayloadFromUberToApache (#2455) wangxianghu 2021-01-29 09:48:50 +08:00