[HUDI-4250][HUDI-4202] Optimize performance of Column Stats Index reading in Data Skipping (#5746)
We provide an alternative way of fetching Column Stats Index within the reading process to avoid the penalty of a more heavy-weight execution scheduled through a Spark engine.
This commit is contained in:
@@ -319,7 +319,7 @@ public class ColumnStatsIndices {
|
||||
.map(colName -> new ColumnIndexID(colName).asBase64EncodedString()).collect(Collectors.toList());
|
||||
|
||||
HoodieData<HoodieRecord<HoodieMetadataPayload>> records =
|
||||
metadataTable.getRecordsByKeyPrefixes(encodedTargetColumnNames, HoodieTableMetadataUtil.PARTITION_NAME_COLUMN_STATS);
|
||||
metadataTable.getRecordsByKeyPrefixes(encodedTargetColumnNames, HoodieTableMetadataUtil.PARTITION_NAME_COLUMN_STATS, false);
|
||||
|
||||
org.apache.hudi.util.AvroToRowDataConverters.AvroToRowDataConverter converter =
|
||||
AvroToRowDataConverters.createRowConverter((RowType) METADATA_DATA_TYPE.getLogicalType());
|
||||
|
||||
Reference in New Issue
Block a user