Spark first read from Cloud Storage is very slow on Dataproc Serverless
I’m running a simple Spark job on Dataproc Serverless, and the first step involves reading some CSV files from Cloud Storage in a standard way:
I’m running a simple Spark job on Dataproc Serverless, and the first step involves reading some CSV files from Cloud Storage in a standard way: