Relative Content

Tag Archive for pythonpandascsv

Convert dictionary coloumn value to tabular format

I am new to jupyter notebook. I am working on a data analysis project.I have a huge dataset in csv format, the first coloumn containing numeric value while the second coloumn contains all data in the form of dictionary. The dataset is neither in tabular format nor in dictionary format.As can be seen in the image, the second coloumn i.e. trade_history stores values in a dictionary format

How do i search two files to know if the values are similar or not

I have two CSV files that I want to compare their contents and print the contents that doesn’t have the same data. Most contents will have the same name but not all their data are accurate. I want to print those whose data are not accurate. Example; the file have the same name but their amount may be different or the same. I want to highlight those ones whose amount are not the same.

extract data from a jumbled csv file

I have large jumbled csv. To grab data from it I used a python script. When I run the script following errors thrown. How can I get rid of this error and process the data file. File is 10M records and about 4GB in size. I tried the same script without chunking with a small file and it works perfectly.

Python(Pandas) Change all values in dataframe column from other rows

I am currently working on a Dataframe which contains data about products, there are products which are regular products, some are variable products and the third kind are the variations of the latter. The product variations currently have no category information but there is a “Parent” column which points to the “SKU” of the parent product. The Dataframe has a structure similar to this: