Relative Content

Tag Archive for pythonapache-sparkpyspark

Pyspark SQL not spliting column

I was trying to split my column using pyspark sql based on the values that are stored in another column, I saw that it worked for some specific values but for some other this is not working.

PySpark NOT_COLUMN_OR_STR Exception on Disconnected List

I am getting an odd pyspark exception when attempting to use filter and lambda functions on a list of ints I’ve collected from a pyspark dataframe, which makes no sense as the data exists in memory as a list and should be completely disconnected from pyspark. Here is the scenario.