ORC 1.5.1 Released

The ORC team is excited to announce the release of ORC v1.5.1.

The new features of ORC 1.5:

  • ORC-179 Add ORC C++ Writer
  • ORC-91 Support for variable length blocks in HDFS.
  • ORC-199 Implement a CSV to ORC converter
  • ORC-344 Support for using Decimal64ColumnVector
  • ORC-345 Adding Decimal64StatisticsImpl
  • ORC-331 Support for building C++ under MSVC.
  • ORC-234 Support for older versions of Hadoop (>= 2.2.x)
  • ORC-305 Added statistics for size on disk

Known issues:

ORC 1.5.0 Released

The ORC team is excited to announce the release of ORC v1.5.0.

The new features of ORC 1.5:

  • ORC-179 Add ORC C++ Writer
  • ORC-91 Support for variable length blocks in HDFS.
  • ORC-199 Implement a CSV to ORC converter
  • ORC-344 Support for using Decimal64ColumnVector
  • ORC-345 Adding Decimal64StatisticsImpl
  • ORC-331 Support for building C++ under MSVC.
  • ORC-234 Support for older versions of Hadoop (>= 2.2.x)
  • ORC-305 Added statistics for size on disk

Known issues:

  • ORC-367 Boolean columns are read incorrectly when using seek.

ORC 1.4.4 Released

The ORC team is excited to announce the release of ORC v1.4.4.

The new features of ORC 1.4:

  • ORC-72 Add benchmark code for file formats.
  • ORC-87 Fix timestamp statistics in C++.
  • ORC-150 Add tool to convert from JSON.
  • ORC-151 Reduce the size of tools.jar.
  • ORC-174 Create a nohive variant of the jars.

Known issues:

ORC 1.4.3 Released

The ORC team is excited to announce the release of ORC v1.4.3.

The new features of ORC 1.4:

  • ORC-72 Add benchmark code for file formats.
  • ORC-87 Fix timestamp statistics in C++.
  • ORC-150 Add tool to convert from JSON.
  • ORC-151 Reduce the size of tools.jar.
  • ORC-174 Create a nohive variant of the jars.

Known issues:

  • CVE-2018-8015 ORC files with malformed types cause stack overflow.

ORC 1.4.2 Released

The ORC team is excited to announce the release of ORC v1.4.2.

The new features of ORC 1.4:

  • ORC-72 Add benchmark code for file formats.
  • ORC-87 Fix timestamp statistics in C++.
  • ORC-150 Add tool to convert from JSON.
  • ORC-151 Reduce the size of tools.jar.
  • ORC-174 Create a nohive variant of the jars.

Known issues:

  • CVE-2018-8015 ORC files with malformed types cause stack overflow.

  • ORC-285 Empty vector batches of floats or doubles cause EOFException

ORC 1.4.1 Released

The ORC team is excited to announce the release of ORC v1.4.1.

The new features of ORC 1.4:

  • ORC-72 Add benchmark code for file formats.
  • ORC-87 Fix timestamp statistics in C++.
  • ORC-150 Add tool to convert from JSON.
  • ORC-151 Reduce the size of tools.jar.
  • ORC-174 Create a nohive variant of the jars.

Known issues:

  • CVE-2018-8015 ORC files with malformed types cause stack overflow.

  • ORC-285 Empty vector batches of floats or doubles cause EOFException

ORC 1.3.4 Released

The ORC team is excited to announce the release of ORC v1.3.4.

The new features of ORC 1.3:

  • ORC-58 Split C++ Reader into Reader and RowReader
  • ORC-120 Add backwards compatibility mode for schema evolution.
  • ORC-124 Fast decimal improvements
  • ORC-128 Add ability to get statistics from writer

Known issues:

  • CVE-2018-8015 ORC files with malformed types cause stack overflow.

  • ORC-285 Empty vector batches of floats or doubles cause EOFException

ORC 1.4.0 Released

The ORC team is excited to announce the release of ORC v1.4.0.

The new features of ORC 1.4:

  • ORC-72 Add benchmark code for file formats.
  • ORC-87 Fix timestamp statistics in C++.
  • ORC-150 Add tool to convert from JSON.
  • ORC-151 Reduce the size of tools.jar.
  • ORC-174 Create a nohive variant of the jars.

Known issues:

  • CVE-2018-8015 ORC files with malformed types cause stack overflow.

  • ORC-285 Empty vector batches of floats or doubles cause EOFException

ORC 1.3.3 Released

The ORC team is excited to announce the release of ORC v1.3.3.

The new features of ORC 1.3:

  • ORC-58 Split C++ Reader into Reader and RowReader
  • ORC-120 Add backwards compatibility mode for schema evolution.
  • ORC-124 Fast decimal improvements
  • ORC-128 Add ability to get statistics from writer

Known issues:

  • CVE-2018-8015 ORC files with malformed types cause stack overflow.

  • ORC-285 Empty vector batches of floats or doubles cause EOFException

ORC 1.3.2 Released

The ORC team is excited to announce the release of ORC v1.3.2.

The new features of ORC 1.3:

  • ORC-58 Split C++ Reader into Reader and RowReader
  • ORC-120 Add backwards compatibility mode for schema evolution.
  • ORC-124 Fast decimal improvements
  • ORC-128 Add ability to get statistics from writer

Known issues:

  • CVE-2018-8015 ORC files with malformed types cause stack overflow.

  • ORC-285 Empty vector batches of floats or doubles cause EOFException

ORC 1.3.1 Released

The ORC team is excited to announce the release of ORC v1.3.1.

The new features of ORC 1.3:

  • ORC-58 Split C++ Reader into Reader and RowReader
  • ORC-120 Add backwards compatibility mode for schema evolution.
  • ORC-124 Fast decimal improvements
  • ORC-128 Add ability to get statistics from writer

Known issues:

  • CVE-2018-8015 ORC files with malformed types cause stack overflow.

  • ORC-135 Predicate push down is incorrect on timestamps when moved between timezones

  • ORC-285 Empty vector batches of floats or doubles cause EOFException

ORC 1.3.0 Released

The ORC team is excited to announce the release of ORC v1.3.0.

The new features of ORC 1.3:

  • ORC-58 Split C++ Reader into Reader and RowReader
  • ORC-120 Add backwards compatibility mode for schema evolution.
  • ORC-124 Fast decimal improvements
  • ORC-128 Add ability to get statistics from writer

Known issues:

  • ORC-135 Predicate push down is incorrect on timestamps when moved between timezones

  • ORC-285 Empty vector batches of floats or doubles cause EOFException

ORC 1.2.3 Released

The ORC team is excited to announce the release of ORC v1.2.3. This release fixes some bugs in the Java schema evolution code.

The new features of ORC 1.2:

  • ORC-54 Evolve schemas based on field name rather than index
  • ORC-84 Create a separate java tool module.
  • ORC-77 and ORC-81 Implement LZO and LZ4 compression codecs.
  • ORC-92 Add support for nested column id selection in C++
  • ORC-69 Add batch option support in orc-scan tools.

Important fixes:

  • HIVE-14214 ORC schema evolution and predicate push down do not work together.

Known issues:

  • CVE-2018-8015 ORC files with malformed types cause stack overflow.

  • ORC-135 Predicate push down is incorrect on timestamps when moved between timezones

  • ORC-285 Empty vector batches of floats or doubles cause EOFException

ORC 1.2.2 Released

The ORC team is excited to announce the release of ORC v1.2.2.

The new features of ORC 1.2:

  • ORC-54 Evolve schemas based on field name rather than index
  • ORC-84 Create a separate java tool module.
  • ORC-77 and ORC-81 Implement LZO and LZ4 compression codecs.
  • ORC-92 Add support for nested column id selection in C++
  • ORC-69 Add batch option support in orc-scan tools.

Important fixes:

  • HIVE-14214 ORC schema evolution and predicate push down do not work together.

Known issues:

  • CVE-2018-8015 ORC files with malformed types cause stack overflow.

  • ORC-135 Predicate push down is incorrect on timestamps when moved between timezones

  • ORC-285 Empty vector batches of floats or doubles cause EOFException

ORC 1.2.1 Released

The ORC team is excited to announce the release of ORC v1.2.1.

The new features of ORC 1.2:

  • ORC-54 Evolve schemas based on field name rather than index
  • ORC-84 Create a separate java tool module.
  • ORC-77 and ORC-81 Implement LZO and LZ4 compression codecs.
  • ORC-92 Add support for nested column id selection in C++
  • ORC-69 Add batch option support in orc-scan tools.

Important fixes:

  • HIVE-14214 ORC schema evolution and predicate push down do not work together.

Known issues:

  • CVE-2018-8015 ORC files with malformed types cause stack overflow.

  • ORC-135 Predicate push down is incorrect on timestamps when moved between timezones

  • ORC-285 Empty vector batches of floats or doubles cause EOFException

ORC 1.2.0 Released

The ORC team is excited to announce the release of ORC v1.2.0.

The new features of ORC 1.2:

  • ORC-54 Evolve schemas based on field name rather than index
  • ORC-84 Create a separate java tool module.
  • ORC-77 and ORC-81 Implement LZO and LZ4 compression codecs.
  • ORC-92 Add support for nested column id selection in C++
  • ORC-69 Add batch option support in orc-scan tools.

Important fixes:

  • HIVE-14214 ORC schema evolution and predicate push down do not work together.

Known issues:

  • CVE-2018-8015 ORC files with malformed types cause stack overflow.

  • ORC-101 Bloom filters for string and decimal use inconsistent encoding

  • ORC-135 Predicate push down is incorrect on timestamps when moved between timezones

  • ORC-285 Empty vector batches of floats or doubles cause EOFException

ORC 1.1.2 Released

The ORC team is excited to announce the release of ORC v1.1.2. This release contains the Java reader and writer and the native C++ ORC reader and tools.

The major new features in ORC 1.1 are:

  • ORC-1 Copy the Java ORC code from Hive.
  • ORC-10 Fix the C++ reader to correctly read timestamps from timezones with different daylight savings rules.
  • ORC-52 Add mapred and mapreduce connectors.

Known issues:

  • CVE-2018-8015 ORC files with malformed types cause stack overflow.

  • HIVE-14214 Schema evolution and predicate pushdown don’t work together.

  • ORC-101 Bloom filters for string and decimal use inconsistent encoding

  • ORC-135 Predicate push down is incorrect on timestamps when moved between timezones

  • ORC-285 Empty vector batches of floats or doubles cause EOFException

ORC 1.1.1 Released

The ORC team is excited to announce the release of ORC v1.1.1. This release contains the Java reader and writer and the native C++ ORC reader and tools.

The major new features in ORC 1.1 are:

  • ORC-1 Copy the Java ORC code from Hive.
  • ORC-10 Fix the C++ reader to correctly read timestamps from timezones with different daylight savings rules.
  • ORC-52 Add mapred and mapreduce connectors.

Known issues:

  • CVE-2018-8015 ORC files with malformed types cause stack overflow.

  • HIVE-14214 Schema evolution and predicate pushdown don’t work together.

  • ORC-101 Bloom filters for string and decimal use inconsistent encoding

  • ORC-135 Predicate push down is incorrect on timestamps when moved between timezones

  • ORC-285 Empty vector batches of floats or doubles cause EOFException

ORC 1.1.0 Released

The ORC team is excited to announce the release of ORC v1.1.0. This release contains the Java reader and writer and the native C++ ORC reader and tools.

Release Artifacts:

The major new features in ORC 1.1 are:

  • ORC-1 Copy the Java ORC code from Hive.
  • ORC-10 Fix the C++ reader to correctly read timestamps from timezones with different daylight savings rules.
  • ORC-52 Add mapred and mapreduce connectors.

Known issues:

  • CVE-2018-8015 ORC files with malformed types cause stack overflow.

  • HIVE-14214 Schema evolution and predicate pushdown don’t work together.

  • ORC-101 Bloom filters for string and decimal use inconsistent encoding

  • ORC-135 Predicate push down is incorrect on timestamps when moved between timezones

  • ORC-285 Empty vector batches of floats or doubles cause EOFException

ORC 1.0.0 Released

The ORC team is excited to announce the release of ORC v1.0.0. This release contains the native C++ ORC reader and some tools.

The major features:

  • Portable pure C++ ORC reader
  • The C++ reader is known to work on:
    • CentOS and RHEL 5, 6, and 7
    • Debian 6 and 7
    • Ubuntu 12 and 14
    • Mac OS 10.10 and 10.11
  • A file-contents command that prints the contents of the file as json records.
  • A file-metadata command that prints the metadata of the file.
  • Docker files for building and testing on various Linux distributions.
  • Memory estimation for the reader.

Known issues:

  • CVE-2018-8015 ORC files with malformed types cause stack overflow.

  • ORC-10 When moving ORC files between timezones, different daylight savings rules will cause timestamps to shift in the C++ reader.