Package io.deephaven.parquet.base
Interface ColumnChunkReader
- All Known Implementing Classes:
ColumnChunkReaderImpl
public interface ColumnChunkReader
-
Nested Class Summary
Nested ClassesModifier and TypeInterfaceDescriptionstatic interfacestatic final class -
Field Summary
FieldsModifier and TypeFieldDescriptionstatic final org.apache.parquet.column.Dictionary -
Method Summary
-
Field Details
-
NULL_DICTIONARY
static final org.apache.parquet.column.Dictionary NULL_DICTIONARY
-
-
Method Details
-
numRows
long numRows()- Returns:
- The number of rows in this ColumnChunk, or -1 if it's unknown.
-
numValues
long numValues()- Returns:
- The value stored under the corresponding ColumnMetaData.num_values field
-
getMaxRl
int getMaxRl()- Returns:
- The depth of the number of nested repeated fields this column is a part of. 0 means this is a simple (non-repeating) field, 1 means this is a flat array.
-
getOffsetIndex
@Nullable @Nullable org.apache.parquet.internal.column.columnindex.OffsetIndex getOffsetIndex()- Returns:
- The offset index for this column chunk, or null if it not found in the metadata.
-
getPageIterator
- Returns:
- An iterator over individual parquet pages
- Throws:
IOException
-
getPageAccessor
ColumnChunkReader.ColumnPageDirectAccessor getPageAccessor()- Returns:
- An accessor for individual parquet pages
-
usesDictionaryOnEveryPage
boolean usesDictionaryOnEveryPage()- Returns:
- Whether this column chunk uses a dictionary-based encoding on every page
-
getDictionarySupplier
Supplier<org.apache.parquet.column.Dictionary> getDictionarySupplier()- Returns:
- Supplier for a Parquet dictionary for this column chunk
- ApiNote:
- The result will never return
null. It will instead supplyNULL_DICTIONARY.
-
getType
org.apache.parquet.schema.PrimitiveType getType() -
getVersion
- Returns:
- The "version" string from deephaven specific parquet metadata, or null if it's not present.
-