Pyspark select after join raises ambiguity but column should only be present in one of the dataframes
I’m doing a join on two dataframes that come from the same original dataframe. These then suffer some aggregations and the columns selected are not equal except for the ones that are used to join.
How to change a value of a row in condition of a value in a previous row in an ordred dataframe by date of a unique id?
I need insights for how to do this in spark: