Relative Content

Tag Archive for pysparkazure-databricks

Pyspark create dataframe in Databricks

Why it is not mandatory to import or create spark session while creating a dataframe in Databricks notebook using pyspark.
Can someone please explain if this is mandatory or not.

copy file structure including files from one storage to another incrementally using pyspark

I have a storage account dexflex and two containers source and destination.
Source container has directory and files as below:
results
search
03
Module19111.json
Module19126.json
04
Module11291.json
Module19222.json
product
03
Module18867.json
Module182625.json
04
Module122251.json
Module192287.json
i am trying to copy the data incrementally from source to destination container by using the below code snippet