Pandas
How to install
Pandas is a fast, powerful, flexible and easy to use open source data analysis and manipulation tool. Since Pandas relies on pyarrow for ORC support, it is required.
pip3 install pandas==2.3.3
pip3 install pyarrow
How to write and read an ORC file
In [1]: import pandas as pd
In [2]: df = pd.DataFrame({"col1": [1, 2, 3], "col2": ["a", "b", None]})
In [3]: df.to_orc("test.orc")
In [4]: pd.read_orc("test.orc")
Out[4]:
col1 col2
0 1 a
1 2 b
2 3 None
In [5]: pd.read_orc("test.orc", columns=["col1"])
Out[5]:
col1
0 1
1 2
2 3
Pandas page provides more information.