Efficiently Split a Large CSV File into Smaller Files by Index Range Without Loading Entire File into RAM
I have a big CSV file (60GB) that does not fit into RAM. The first column contains a sorted index that goes from 2000 to 2999 and can be repeated between rows. I want to split the 60GB file into 10 files of approximately 6 GB each, but without “splitting” the index between two of the files.
Efficiently Split a Large CSV File into Smaller Files by Index Range Without Loading Entire File into RAM
I have a big CSV file (60GB) that does not fit into RAM. The first column contains a sorted index that goes from 2000 to 2999 and can be repeated between rows. I want to split the 60GB file into 10 files of approximately 6 GB each, but without “splitting” the index between two of the files.
fficiently Split a Large CSV File into Smaller Files by Index Range Without Loading Entire File into RAM
I have a big CSV file (60GB) that does not fit into RAM. The first column contains a sorted index that goes from 2000 to 2999 and can be repeated between rows. I want to split the 60GB file into 10 files of approximately 6 GB each, but without “splitting” the index between two of the files.