Relative Content

Tag Archive for python-3.xpysparkfeature-engineeringpandas-udf

Create time based features in Pyspark

I have a feature table Pyspark DF that gets created every day through a pipeline. Now the ask is to create time based features for each feature where each t-1 till t-30 (t=time) features captures the previous day value.