How the pdf I want to convert to text or even excel looks like
example
Hey guys, I’m trying to scrape some information a pdf file so I can organize it as a nice text format to feed to our chatbot. I’ve used PyPDF2, pdfplumber, tabula-py, and none of them worked. They were able to extract the information to text, but it’s literally extracting the text but not in a format that the chatbot would understand. Could you guide me some suggestions? Thank you!
New contributor