All Classes and Interfaces (ORC Core 2.2.0 API)

Class

Description

Statistics about the ACID operations in an ORC file

Defines a batch filter that can operate on a VectorizedRowBatch and filter rows by using the selected vector to determine the eligible rows.

BatchReader

The top level interface that the reader uses to read the columns from the ORC file.

BinaryColumnStatistics

Statistics for binary columns.

BloomFilter is a probabilistic data structure for set membership check.

BloomFilter.BitSet

Bare metal bit set implementation.

BloomFilterIO

BloomFilterIO.Encoding

BloomFilterUtf8

This class represents the fix from ORC-101 where we fixed the bloom filter from using the JVM's default character set to always using UTF-8.

BooleanColumnStatistics

Statistics for boolean columns.

BooleanTreeWriter

BoundingBox

Bounding box for Geometry or Geography type in the representation of min/max value pairs of coordinates from each axis.

BrotliCodec

BufferChunk

The sections of stripe that we have read.

BufferChunkList

Builds a list of buffer chunks

ByteTreeWriter

CharTreeWriter

Under the covers, char is written to ORC the same way as string.

CollectionColumnStatistics

Statistics for all of collections such as Map and List.

ColumnStatistics

Statistics that are available for all types of columns.

ColumnStatisticsImpl

ColumnStatisticsImpl.BinaryStatisticsImpl

ColumnStatisticsImpl.StringStatisticsImpl

CompressionCodec

The API for compression codecs for ORC.

CompressionCodec.DataKind

CompressionCodec.Options

CompressionCodec.SpeedModifier

CompressionKind

An enumeration that lists the generic compression algorithms that can be applied to ORC files.

ConvertTreeReaderFactory

Convert ORC tree readers.

ConvertTreeReaderFactory.AnyIntegerFromAnyIntegerTreeReader

ConvertTreeReaderFactory.AnyIntegerFromDecimalTreeReader

ConvertTreeReaderFactory.AnyIntegerFromDoubleTreeReader

ConvertTreeReaderFactory.AnyIntegerFromStringGroupTreeReader

ConvertTreeReaderFactory.AnyIntegerFromTimestampTreeReader

ConvertTreeReaderFactory.ConvertTreeReader

Override methods like checkEncoding to pass-thru to the convert TreeReader.

ConvertTreeReaderFactory.DateFromStringGroupTreeReader

ConvertTreeReaderFactory.DateFromTimestampTreeReader

ConvertTreeReaderFactory.DecimalFromAnyIntegerTreeReader

ConvertTreeReaderFactory.DecimalFromDecimalTreeReader

ConvertTreeReaderFactory.DecimalFromDoubleTreeReader

ConvertTreeReaderFactory.DecimalFromStringGroupTreeReader

ConvertTreeReaderFactory.DecimalFromTimestampTreeReader

ConvertTreeReaderFactory.DoubleFromAnyIntegerTreeReader

ConvertTreeReaderFactory.DoubleFromDecimalTreeReader

ConvertTreeReaderFactory.DoubleFromStringGroupTreeReader

ConvertTreeReaderFactory.DoubleFromTimestampTreeReader

ConvertTreeReaderFactory.FloatFromDoubleTreeReader

ConvertTreeReaderFactory.StringGroupFromAnyIntegerTreeReader

ConvertTreeReaderFactory.StringGroupFromBinaryTreeReader

ConvertTreeReaderFactory.StringGroupFromBooleanTreeReader

ConvertTreeReaderFactory.StringGroupFromDateTreeReader

ConvertTreeReaderFactory.StringGroupFromDecimalTreeReader

ConvertTreeReaderFactory.StringGroupFromDoubleTreeReader

ConvertTreeReaderFactory.StringGroupFromStringGroupTreeReader

ConvertTreeReaderFactory.StringGroupFromTimestampTreeReader

ConvertTreeReaderFactory.TimestampFromAnyIntegerTreeReader

ConvertTreeReaderFactory.TimestampFromDateTreeReader

ConvertTreeReaderFactory.TimestampFromDecimalTreeReader

ConvertTreeReaderFactory.TimestampFromDoubleTreeReader

ConvertTreeReaderFactory.TimestampFromStringGroupTreeReader

CryptoUtils

This class has routines to work with encryption within ORC files.

CryptoUtils.HadoopKeyProviderFactory

CuckooSetBytes

A high-performance set implementation used to support fast set membership testing, using Cuckoo hashing.

DataMask

The API for masking data during column encryption for ORC.

DataMask.Factory

To create a DataMask, the users should come through this API.

DataMask.MaskOverrides

An interface to provide override data masks for sub-columns.

DataMask.Provider

Providers can provide one or more kinds of data masks.

DataMask.Standard

The standard DataMasks can be created using this short cut.

DataMaskDescription

Information about the DataMask used to mask the unencrypted data.

DataReader

An abstract data reader that IO formats can use to read bytes from underlying storage.

DataReaderProperties

DataReaderProperties.Builder

DateColumnStatistics

Statistics for DATE columns.

DateTreeWriter

DateUtils

Conversion utilities from the hybrid Julian/Gregorian calendar to/from the proleptic Gregorian.

Decimal64TreeWriter

Writer for short decimals in ORCv2.

DecimalColumnStatistics

Statistics for decimal columns.

DecimalIdentity

An identity data mask for decimal types.

DecimalTreeWriter

Dictionary

Interface to define the dictionary used for encoding value in columns of specific types like string, char, varchar, etc.

Dictionary.IMPL

Dictionary.Visitor

The interface for visitors.

Dictionary.VisitorContext

The information about each node.

DictionaryUtils

DirectDecompressionCodec

DoubleColumnStatistics

Statistics for float and double columns.

DoubleIdentity

An identity data mask for floating point types.

DoubleTreeWriter

DynamicByteArray

A class that is a growable array of bytes.

DynamicIntArray

Dynamic int array that uses primitive types and chunks to avoid copying large number of integers when it resizes.

EncryptionKey

Information about a key used for column encryption in an ORC file.

EncryptionTreeWriter

TreeWriter that handles column encryption.

EncryptionVariant

Information about a column encryption variant.

FileFormatException

Thrown when an invalid file format is encountered.

FileMetadata

Deprecated.

Use OrcTail instead

FilterFactory

FilterFactory.UnSupportedSArgException

FloatTreeWriter

GeospatialColumnStatistics

GeospatialTreeWriter

GeospatialTypes

A list of geospatial types from all instances in the Geometry or Geography column, or an empty list if they are not known.

HadoopShimsFactory

The factory for getting the proper version of the Hadoop shims.

HybridChronology

The Julian-Gregorian hybrid calendar system.

HybridDate

A date in the British Cutover calendar system.

InMemoryKeystore

This is an in-memory implementation of KeyProvider.

InStream

InStream.CompressedStream

InStream.EncryptedStream

Implements a stream over an encrypted, but uncompressed stream.

InStream.StreamOptions

InStream.UncompressedStream

Implements a stream over an uncompressed stream.

IntegerColumnStatistics

Statistics for all of the integer columns, such as byte, short, int, and long.

IntegerReader

Interface for reading integers.

IntegerTreeWriter

IntegerWriter

Interface for writing integers.

IOUtils

This is copied from commons-io project to cut the dependency from old Hadoop.

A data mask for list types that applies the given masks to its children, but doesn't mask at this level.

ListTreeWriter

LongIdentity

An identity data mask for integer types.

MapIdentity

A data mask for map types that applies the given masks to its children, but doesn't mask at this level.

MapTreeWriter

MaskDescriptionImpl

MaskFactory

A mask factory framework that automatically builds a recursive mask.

MaskProvider

The Provider for all of the built-in data masks.

MemoryManager

Deprecated.

MemoryManager

A memory manager that keeps a global context of how many ORC writers there are and manages the memory between them.

MemoryManager.Callback

MemoryManagerImpl

Implements a memory manager that keeps a global context of how many ORC writers there are and manages the memory between them.

Murmur3

Murmur3 is successor to Murmur2 fast non-crytographic hash algorithms.

NullifyMask

Masking routine that converts every value to NULL.

OrcAcidUtils

OrcCodecPool

A clone of Hadoop codec pool for ORC; cause it has its own codecs...

OrcConf

Define the configuration properties that Orc understands.

OrcFile

Contains factory methods to read or write ORC files.

OrcFile.BloomFilterVersion

OrcFile.CompressionStrategy

OrcFile.EncodingStrategy

OrcFile.ReaderOptions

OrcFile.Version

Create a version number for the ORC file format, so that we can add non-forward compatible changes in the future.

OrcFile.WriterCallback

OrcFile.WriterContext

OrcFile.WriterImplementation

OrcFile.WriterOptions

Options for creating ORC file writers.

OrcFile.WriterVersion

Records the version of the writer in terms of which bugs have been fixed.

OrcFile.ZstdCompressOptions

OrcFilterContext

This defines the input for any filter operation.

OrcFilterContextImpl

This defines the input for any filter operation.

The output stream for writing to ORC files.

ParserUtils

ParserUtils.StringPosition

ParserUtils.TypeFinder

ParserUtils.TypeVisitor

PhysicalFsWriter

PhysicalFsWriter.VariantTracker

Record the information about each column encryption variant.

PhysicalWriter

This interface separates the physical layout of ORC files from the higher level details.

PhysicalWriter.OutputReceiver

The target of an output stream.

PluginFilterService

Service to determine Plugin filters to be used during read.

PositionedOutputStream

PositionProvider

An interface used for seeking to a row index.

PositionRecorder

An interface for recording positions in a stream.

PrimitiveBatchReader

Reader

The interface for reading ORC files.

Reader.Options

Options for creating a RecordReader.

ReaderEncryption

ReaderEncryptionKey

This tracks the keys for reading encrypted columns.

ReaderEncryptionKey.State

Store the state of whether we've tried to decrypt a local key using this key or not.

ReaderEncryptionVariant

Information about an encrypted column.

ReaderImpl

ReaderImpl.StripeInformationImpl

RecordReader

A row-by-row iterator for ORC files.

RecordReaderImpl

RecordReaderImpl.PositionProviderImpl

RecordReaderImpl.SargApplier

RecordReaderImpl.ZeroPositionProvider

RecordReaderUtils

Stateless methods shared between RecordReaderImpl and EncodedReaderImpl.

RecordReaderUtils.ByteBufferAllocatorPool

RedactMaskFactory

Masking strategy that hides most string and numeric values based on unicode character categories.

RunLengthByteReader

A reader that reads a sequence of bytes.

RunLengthByteWriter

A streamFactory that writes a sequence of bytes.

RunLengthIntegerReader

A reader that reads a sequence of integers.

RunLengthIntegerReaderV2

A reader that reads a sequence of light weight compressed integers.

RunLengthIntegerWriter

A streamFactory that writes a sequence of integers.

RunLengthIntegerWriterV2

A writer that performs light weight compression over sequence of integers.

RunLengthIntegerWriterV2.EncodingType

SchemaEvolution

Infer and track the evolution between the schema as stored in the file and the schema that has been requested by the reader.

SchemaEvolution.IllegalEvolutionException

Selected

Wrapper class for the selected vector that centralizes the convenience functions

SerializationUtils

SerializationUtils.FixedBitSizes

SHA256MaskFactory

Masking strategy that masks String, Varchar, Char and Binary types as SHA 256 hash.

SnappyCodec

StreamName

The name of a stream within a stripe.

StreamName.Area

StreamOptions

The compression and encryption options for writing a stream.

StreamWrapperFileSystem

This class provides an adaptor so that tools that want to read an ORC file from an FSDataInputStream can do so.

StringBaseTreeWriter

StringColumnStatistics

Statistics for string columns.

StringHashTableDictionary

Using HashTable to represent a dictionary.

StringRedBlackTree

A red-black tree that stores strings.

StringTreeWriter

StripeInformation

Information about the stripes in an ORC file that is provided by the Reader.

StripePlanner

This class handles parsing the stripe information and handling the necessary filtering and selection.

StripePlanner.StreamInformation

StripeStatistics

The statistics for a stripe.

StripeStatisticsImpl

StructBatchReader

Handles the Struct rootType for batch handling.

StructIdentity

A data mask for struct types that applies the given masks to its children, but doesn't mask at this level.

StructTreeWriter

TimestampColumnStatistics

Statistics for Timestamp columns.

TimestampTreeWriter

TreeReaderFactory

Factory for creating ORC tree readers.

TreeReaderFactory.BinaryTreeReader

TreeReaderFactory.BooleanTreeReader

TreeReaderFactory.BytesColumnVectorUtil

TreeReaderFactory.ByteTreeReader

TreeReaderFactory.CharTreeReader

TreeReaderFactory.Context

TreeReaderFactory.DateTreeReader

TreeReaderFactory.Decimal64TreeReader

TreeReaderFactory.DecimalTreeReader

TreeReaderFactory.DoubleTreeReader

TreeReaderFactory.FloatTreeReader

TreeReaderFactory.GeospatialTreeReader

TreeReaderFactory.IntTreeReader

TreeReaderFactory.ListTreeReader

TreeReaderFactory.LongTreeReader

TreeReaderFactory.MapTreeReader

TreeReaderFactory.NullTreeReader

TreeReaderFactory.ReaderContext

TreeReaderFactory.ShortTreeReader

TreeReaderFactory.StringDictionaryTreeReader

A reader for string columns that are dictionary encoded in the current stripe.

TreeReaderFactory.StringDirectTreeReader

A reader for string columns that are direct encoded in the current stripe.

TreeReaderFactory.StringTreeReader

A tree reader that will read string columns.

TreeReaderFactory.StructTreeReader

TreeReaderFactory.TimestampTreeReader

TreeReaderFactory.TreeReader

TreeReaderFactory.UnionTreeReader

TreeReaderFactory.VarcharTreeReader

TreeWriter

The writers for the specific writers of each type.

TreeWriter.Factory

TreeWriterBase

The parent class of all of the writers for each column.

TypeDescription

This is the description of the types in an ORC file.

TypeDescription.Category

TypeDescription.EdgeInterpolationAlgorithm

TypeDescription.RowBatchVersion

Specify the version of the VectorizedRowBatch that the user desires.

TypeDescriptionPrettyPrint

A pretty printer for TypeDescription.

TypeReader

TypeReader.ReaderCategory

TypeReader.ReadPhase

TypeUtils

UnionIdentity

A data mask for union types that applies the given masks to its children, but doesn't mask at this level.

UnionTreeWriter

UnknownFormatException

Deprecated.

This will be removed in the future releases.

Utf8Utils

VarcharTreeWriter

Under the covers, varchar is written to ORC the same way as string.

VectorFilter

A filter that operates on the supplied VectorizedRowBatch and updates the selections.

VisitorContextImpl

Base implementation for Dictionary.VisitorContext used to traversing all nodes in a dictionary.

Writer

The interface for writing ORC files.

WriterContext

WriterEncryptionKey

WriterEncryptionVariant

WriterImpl

An ORC file writer.

WriterImplV2

An ORCv2 file writer.

WriterInternal

The ORC internal API to the writer.

ZlibCodec

ZstdCodec