[HUDI-3172] Refactor hudi existing modules to make more code reuse in V2 Implementation (#4514)
* Introduce hudi-spark3-common and hudi-spark2-common modules to place classes that would be reused in different spark versions, also introduce hudi-spark3.1.x to support spark 3.1.x. * Introduce hudi format under hudi-spark2, hudi-spark3, hudi-spark3.1.x modules and change the hudi format in original hudi-spark module to hudi_v1 format. * Manually tested on Spark 3.1.2 and Spark 3.2.0 SQL. * Added a README.md file under hudi-spark-datasource module.
This commit is contained in:
38
hudi-spark-datasource/README.md
Normal file
38
hudi-spark-datasource/README.md
Normal file
@@ -0,0 +1,38 @@
|
||||
<!--
|
||||
* Licensed to the Apache Software Foundation (ASF) under one
|
||||
* or more contributor license agreements. See the NOTICE file
|
||||
* distributed with this work for additional information
|
||||
* regarding copyright ownership. The ASF licenses this file
|
||||
* to you under the Apache License, Version 2.0 (the
|
||||
* "License"); you may not use this file except in compliance
|
||||
* with the License. You may obtain a copy of the License at
|
||||
*
|
||||
* http://www.apache.org/licenses/LICENSE-2.0
|
||||
*
|
||||
* Unless required by applicable law or agreed to in writing, software
|
||||
* distributed under the License is distributed on an "AS IS" BASIS,
|
||||
* WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
|
||||
* See the License for the specific language governing permissions and
|
||||
-->
|
||||
|
||||
# Description of the relationship between each module
|
||||
|
||||
This repo contains the code that integrate Hudi with Spark. The repo is split into the following modules
|
||||
|
||||
`hudi-spark`
|
||||
`hudi-spark2`
|
||||
`hudi-spark3`
|
||||
`hudi-spark3.1.x`
|
||||
`hudi-spark2-common`
|
||||
`hudi-spark3-common`
|
||||
`hudi-spark-common`
|
||||
|
||||
* hudi-spark is the module that contains the code that both spark2 & spark3 version would share, also contains the antlr4
|
||||
file that supports spark sql on spark 2.x version.
|
||||
* hudi-spark2 is the module that contains the code that compatible with spark 2.x versions.
|
||||
* hudi-spark3 is the module that contains the code that compatible with spark 3.2.0(and above) versions。
|
||||
* hudi-spark3.1.x is the module that contains the code that compatible with spark3.1.x and spark3.0.x version.
|
||||
* hudi-spark2-common is the module that contains the code that would be reused between spark2.x versions, right now the module
|
||||
has no class since hudi only supports spark 2.4.4 version, and it acts as the placeholder when packaging hudi-spark-bundle module.
|
||||
* hudi-spark3-common is the module that contains the code that would be reused between spark3.x versions.
|
||||
* hudi-spark-common is the module that contains the code that would be reused between spark2.x and spark3.x versions.
|
||||
Reference in New Issue
Block a user