vinoyang
|
52524b659d
|
[HUDI-2165] Support Transformer for HoodieFlinkStreamer (#3270)
* [HUDI-2165] Support Transformer for HoodieFlinkStreamer
|
2021-07-14 23:01:52 +08:00 |
|
喻兆靖
|
aff1a1ed29
|
[HUDI-2171] Add parallelism conf for bootstrap operator
|
2021-07-13 17:55:12 +08:00 |
|
swuferhong
|
9b01d2a045
|
[HUDI-2142] Support setting bucket assign parallelism for flink write task (#3239)
|
2021-07-10 15:43:36 +08:00 |
|
Danny Chan
|
942a024e74
|
[HUDI-2143] Tweak the default compaction target IO to 500GB when flink async compaction is off (#3238)
|
2021-07-10 15:40:30 +08:00 |
|
yuzhaojing
|
783c9cb369
|
[HUDI-2087] Support Append only in Flink stream (#3252)
Co-authored-by: 喻兆靖 <yuzhaojing@bilibili.com>
|
2021-07-10 14:49:35 +08:00 |
|
vinoyang
|
7c6eebf98c
|
[MINOR] Fix some wrong assert reasons (#3248)
|
2021-07-10 14:35:40 +08:00 |
|
wangxianghu
|
3b2a4f2b6b
|
[HUDI-2147] Remove unused class AvroConvertor in hudi-flink (#3243)
|
2021-07-10 10:16:33 +08:00 |
|
vinoth chandar
|
b4562e86e4
|
Revert "[HUDI-2087] Support Append only in Flink stream (#3174)" (#3251)
This reverts commit 371526789d.
|
2021-07-09 11:20:09 -07:00 |
|
yuzhaojing
|
371526789d
|
[HUDI-2087] Support Append only in Flink stream (#3174)
Co-authored-by: 喻兆靖 <yuzhaojing@bilibili.com>
|
2021-07-09 16:06:32 +08:00 |
|
wangxianghu
|
f2621da32f
|
[HUDI-2093] Fix empty avro schema path caused by duplicate parameters (#3177)
* [HUDI-2093] Fix empty avro schema path caused by duplicate parameters
* rename shcmea option key
* fix doc
* rename var name
|
2021-07-06 15:14:30 +08:00 |
|
Danny Chan
|
1d6978cde4
|
[HUDI-2135] Add compaction schedule option for flink (#3226)
|
2021-07-06 14:11:20 +08:00 |
|
swuferhong
|
bc313727e3
|
[HUDI-2106] Fix flink batch compaction bug while user don't set compaction tasks (#3192)
|
2021-07-06 09:10:37 +08:00 |
|
Danny Chan
|
32bd8ce088
|
[HUDI-2132] Make coordinator events as POJO for efficient serialization (#3223)
|
2021-07-06 09:02:38 +08:00 |
|
Danny Chan
|
e6ee7bdb51
|
[HUDI-2129] StreamerUtil.medianInstantTime should return a valid date time string (#3221)
|
2021-07-05 20:56:24 +08:00 |
|
Danny Chan
|
98ec017bc8
|
[HUDI-2126] The coordinator send events to write function when there are no data for the checkpoint (#3219)
|
2021-07-05 11:34:18 +08:00 |
|
Danny Chan
|
d424fe6072
|
[HUDI-2121] Add operator uid for flink stateful operators (#3212)
|
2021-07-02 19:44:32 +08:00 |
|
Danny Chan
|
7462fdefc3
|
[HUDI-2112] Support reading pure logs file group for flink batch reader after compaction (#3202)
|
2021-07-02 16:29:22 +08:00 |
|
pengzhiwei
|
b34d53fa9c
|
[HUDI-2088] Missing Partition Fields And PreCombineField In Hoodie Properties For Table Written By Flink (#3171)
|
2021-07-01 17:25:18 +08:00 |
|
vinoth chandar
|
d07def1290
|
[MINOR] Fix broken build due to FlinkOptions (#3198)
|
2021-06-30 20:34:58 -07:00 |
|
wenningd
|
d412fb2fe6
|
[HUDI-89] Add configOption & refactor all configs based on that (#2833)
Co-authored-by: Wenning Ding <wenningd@amazon.com>
|
2021-06-30 14:26:30 -07:00 |
|
yuzhaojing
|
07e93de8b4
|
[HUDI-2052] Support load logFile in BootstrapFunction (#3134)
Co-authored-by: 喻兆靖 <yuzhaojing@bilibili.com>
|
2021-06-30 20:37:00 +08:00 |
|
yuzhaojing
|
1cbf43b6e7
|
[HUDI-2103] Add rebalance before index bootstrap (#3185)
Co-authored-by: 喻兆靖 <yuzhaojing@bilibili.com>
|
2021-06-30 16:40:55 +08:00 |
|
wangxianghu
|
202887b8ca
|
[HUDI-2092] Fix NPE caused by FlinkStreamerConfig#writePartitionUrlEncode null value (#3176)
|
2021-06-30 09:21:06 +08:00 |
|
swuferhong
|
f665db071f
|
[HUDI-2085] Support specify compaction paralleism and compaction target io for flink batch compaction (#3169)
|
2021-06-29 22:53:01 +08:00 |
|
swuferhong
|
5a7d1b3d6c
|
[HUDI-2097] Fix Flink unable to read commit metadata error (#3180)
|
2021-06-29 22:43:47 +08:00 |
|
Danny Chan
|
b8a8f572d6
|
[HUDI-2094] Supports hive style partitioning for flink writer (#3178)
|
2021-06-29 15:34:26 +08:00 |
|
yuzhaojing
|
37b7c65d8a
|
[HUDI-2084] Resend the uncommitted write metadata when start up (#3168)
Co-authored-by: 喻兆靖 <yuzhaojing@bilibili.com>
|
2021-06-29 08:53:52 +08:00 |
|
Vinay Patil
|
34fc8a8880
|
[HUDI-2067] Sync FlinkOptions config to FlinkStreamerConfig (#3151)
|
2021-06-28 19:26:08 +08:00 |
|
wangxianghu
|
9e61dad597
|
[MINOR] Drop duplicate keygenerator class configuration setting (#3167)
|
2021-06-28 17:11:32 +08:00 |
|
Danny Chan
|
d24341d10c
|
[HUDI-2074] Use while loop instead of recursive call in MergeOnReadInputFormat#MergeIterator to avoid StackOverflow (#3159)
|
2021-06-28 16:03:10 +08:00 |
|
wangxianghu
|
f73bedd374
|
[MINOR] Remove unused methods (#3152)
|
2021-06-26 13:19:26 +08:00 |
|
Danny Chan
|
e64fe55054
|
[HUDI-2068] Skip the assign state for SmallFileAssign when the state can not assign initially (#3148)
|
2021-06-25 08:57:56 +08:00 |
|
yuzhaojing
|
218f2a6df8
|
[HUDI-2062] Catch FileNotFoundException in WriteProfiles #getCommitMetadata Safely (#3138)
Co-authored-by: 喻兆靖 <yuzhaojing@bilibili.com>
|
2021-06-25 08:54:59 +08:00 |
|
yuzhaojing
|
380518e232
|
[HUDI-2038] Support rollback inflight compaction instances for CompactionPlanOperator (#3105)
Co-authored-by: 喻兆靖 <yuzhaojing@bilibili.com>
|
2021-06-23 20:58:52 +08:00 |
|
Danny Chan
|
2687eab8f0
|
[HUDI-2054] Remove the duplicate name for flink write pipeline (#3135)
|
2021-06-23 14:49:38 +08:00 |
|
yuzhaojing
|
5db37c255b
|
[HUDI-2047] Ignore FileNotFoundException in WriteProfiles #getWritePathsOfInstant (#3125)
Co-authored-by: 喻兆靖 <yuzhaojing@bilibili.com>
|
2021-06-22 14:18:46 +08:00 |
|
swuferhong
|
f8d9242372
|
[HUDI-2050] Support rollback inflight compaction instances for batch flink compactor (#3124)
|
2021-06-21 20:32:48 +08:00 |
|
Danny Chan
|
adf167991a
|
[HUDI-2049] StreamWriteFunction should wait for the next inflight instant time before flushing (#3123)
|
2021-06-21 20:15:27 +08:00 |
|
Danny Chan
|
cdb9b48170
|
[HUDI-2040] Make flink writer as exactly-once by default (#3106)
|
2021-06-18 13:55:23 +08:00 |
|
Danny Chan
|
aa6342c3c9
|
[HUDI-2036] Move the compaction plan scheduling out of flink writer coordinator (#3101)
Since HUDI-1955 was fixed, we can move the scheduling out if the
coordinator to make the coordinator more lightweight.
|
2021-06-18 09:35:09 +08:00 |
|
vinoyang
|
67c3124352
|
[HUDI-2032] Make keygen class and keygen type optional for FlinkStreamerConfig (#3104)
* [HUDI-2032] Make keygen class and keygen type optional for FlinkStreamerConfig
* Address the review suggestion
|
2021-06-17 21:22:13 +08:00 |
|
yuzhaojing
|
f97dd25d41
|
[HUDI-2019] Set up the file system view storage config for singleton embedded server write config every time (#3102)
Co-authored-by: 喻兆靖 <yuzhaojing@bilibili.com>
|
2021-06-17 20:28:03 +08:00 |
|
Danny Chan
|
6763b45dd4
|
[HUDI-2030] Add metadata cache to WriteProfile to reduce IO (#3090)
Keeps same number of instant metadata cache and refresh the cache on new
commits.
|
2021-06-17 19:10:34 +08:00 |
|
Danny Chan
|
0b57483a8e
|
[HUDI-2015] Fix flink operator uid to allow multiple pipelines in one job (#3091)
|
2021-06-17 09:08:19 +08:00 |
|
swuferhong
|
8b0a502c4f
|
[HUDI-2014] Support flink hive sync in batch mode (#3081)
|
2021-06-16 14:29:16 +08:00 |
|
Danny Chan
|
cb642ceb75
|
[HUDI-1999] Refresh the base file view cache for WriteProfile (#3067)
Refresh the view to discover new small files.
|
2021-06-15 08:18:38 -07:00 |
|
swuferhong
|
0c4f2fdc15
|
[HUDI-1984] Support independent flink hudi compaction function (#3046)
|
2021-06-13 15:04:46 +08:00 |
|
Danny Chan
|
125415a8b8
|
[HUDI-1994] Release the new records iterator for append handle #close (#3058)
|
2021-06-10 19:09:23 +08:00 |
|
yuzhaojing
|
728089a888
|
delete duplicate bootstrap function (#3052)
Co-authored-by: 喻兆靖 <yuzhaojing@bilibili.com>
|
2021-06-09 19:29:57 +08:00 |
|
Danny Chan
|
e8fcf04b57
|
[HUDI-1987] Fix non partition table hive meta sync for flink writer (#3049)
|
2021-06-09 14:20:04 +08:00 |
|