Infer `pyarrow.DataType` from Python type?
Consider the following example:
Infer `DataType` from Python type?
Consider the following example:
Get categories of arrow chunkedarray
If I have a PyArrow chunkedarray and want to know all its categories, I can go through each individual array, get the categories from there, and find the union:
Using Pyarrow, how do you group by a value, then select the top K largest values in that group?
I’m looking group by a PyArrow column, then within each of those groups select the top K values without using Pandas. For example, given this dataset: