Scraping table-looking data from pdf file

  Kiến thức lập trình

How the pdf I want to convert to text or even excel looks like
example

Hey guys, I’m trying to scrape some information a pdf file so I can organize it as a nice text format to feed to our chatbot. I’ve used PyPDF2, pdfplumber, tabula-py, and none of them worked. They were able to extract the information to text, but it’s literally extracting the text but not in a format that the chatbot would understand. Could you guide me some suggestions? Thank you!

New contributor

Kyle Butler is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.

Theme wordpress giá rẻ Theme wordpress giá rẻ Thiết kế website

LEAVE A COMMENT