diff --git a/README.md b/README.md index 427d8595f..de4c71000 100644 --- a/README.md +++ b/README.md @@ -34,6 +34,7 @@ Hudi manages the storage of large analytical datasets on DFS (Cloud stores, HDFS * Manages file sizes, layout using statistics * Async compaction of row & columnar data * Timeline metadata to track lineage +* Optimize data lake layout with clustering Hudi supports three types of queries: * **Snapshot Query** - Provides snapshot queries on real-time data, using a combination of columnar & row-based storage (e.g [Parquet](https://parquet.apache.org/) + [Avro](https://avro.apache.org/docs/current/mr.html)).