Relative Content

Tag Archive for pythonpandasparquetpyarrow

How can I extract data from parquet files using pyarrow?

I’m trying to perform data analysis on a large number of quite large parquet files. The analysis itself is relatively simple, but using e.g. pandas requires nested for loops to slice the data into increasingly small bites to then extract the data.