The ORC team is excited to announce the release of ORC v2.0.1.

The improvements (tools):

  • ORC-1644 Add merge tool to merge multiple ORC files into a single ORC file
  • ORC-1647 Tips for supporting ORC in the convert command
  • ORC-1667 Add check tool to check the index of the specified column

The bug fixes:

  • ORC-1646 Close the reader when reading the schema with the convert command
  • ORC-1654 [C++] Count up EvaluatedRowGroupCount correctly
  • ORC-1684 [C++] Find tzdb without TZDIR when in conda-environments
  • ORC-1688 [C++] Do not access TZDB if there is no timestamp type
  • ORC-1696 Fix ClassCastException when reading avro decimal type in bechmark The tasks:
  • ORC-1649 [C++][Conan] Add 2.0.0 to conan recipe and update release guide
  • ORC-1669 [C++] Deprecate HDFS support
  • ORC-1686 [C++] Avoid using std::filesystem

The test changes:

  • ORC-1648 Add test to convert ORC in the convert command
  • ORC-1663 [C++] Enable TestTimezone.testMissingTZDB on Windows
  • ORC-1672 Remove test packages o.a.o.tools.check
  • ORC-1673 Remove test packages o.a.o.tools.[count|merge|sizes]
  • ORC-1676 Use Hive 4.0.0 in benchmark
  • ORC-1681 Remove redundant import statement in tests to fix checkstyle failures
  • ORC-1699 Fix SparkBenchmark in Parquet format according to SPARK-40918
  • ORC-1704 Migration to Scala 2.13 of Apache Spark 3.5.1 at SparkBenchmark
  • ORC-1707 Fix sun.util.calendar IllegalAccessException when SparkBenchmark runs on JDK17
  • ORC-1708 Support data/compress options in Hive benchmark

The build and dependency changes:

The documentation changes:

  • ORC-1668 Add merge command to Java tools documentation