add hoodie delta streamer mock source example and dfs source and kafka source examples Signed-off-by: dengziming <dengziming1993@gmail.com> add defaultSparkConf utils method change version of hudi-examples to 0.5.2-SNAPSHOT change the artifcatId of hudi-spark and hudi-utilities alter some code to adapt kafka2.0 Update scritps Add license
36 lines
2.0 KiB
Markdown
36 lines
2.0 KiB
Markdown
<!--
|
|
Licensed to the Apache Software Foundation (ASF) under one or more
|
|
contributor license agreements. See the NOTICE file distributed with
|
|
this work for additional information regarding copyright ownership.
|
|
The ASF licenses this file to You under the Apache License, Version 2.0
|
|
(the "License"); you may not use this file except in compliance with
|
|
the License. You may obtain a copy of the License at
|
|
|
|
http://www.apache.org/licenses/LICENSE-2.0
|
|
|
|
Unless required by applicable law or agreed to in writing, software
|
|
distributed under the License is distributed on an "AS IS" BASIS,
|
|
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
|
|
See the License for the specific language governing permissions and
|
|
limitations under the License.
|
|
-->
|
|
|
|
This directory contains examples code that uses hudi.
|
|
|
|
To run the demo:
|
|
|
|
1. Configure your `SPARK_MASTER` env variable, yarn-cluster mode by default.
|
|
2. For hudi write client demo and hudi data source demo, just use spark-submit as common spark app
|
|
3. For hudi delta streamer demo of custom source, run `bin/custom-delta-streamer-example.sh`
|
|
4. For hudi delta streamer demo of dfs source:
|
|
4.1 Prepare dfs data, we have provided `src/main/resources/delta-streamer-config/dfs/source-file.json` for test
|
|
4.2 Run `bin/dfs-delta-streamer-example.sh`
|
|
5. For hudi delta streamer demo of dfs source:
|
|
5.1 Start Kafka server
|
|
5.2 Configure your Kafka properties, we have provided `src/main/resources/delta-streamer-config/kafka/kafka-source.properties` for test
|
|
5.3 Run `bin/kafka-delta-streamer-example.sh`
|
|
5.4 continuously write source data to the Kafka topic your configured with `hoodie.deltastreamer.source.kafka.topic` in `kafka-source.properties`
|
|
6. Some notes delta streamer demo:
|
|
6.1 The configuration files we provided is just the simplest demo, you can change it according to your specific needs.
|
|
6.2 You could also use Intellij to run the example directly by configuring parameters as "Program arguments"
|