Using LLM to extract images and description from PDF file
I’m working on a project now where we contemplate to use a LLM to extract product information and images from a pdf file.
Looking for ways to extract text from complicated PDF’s
I am currently developing a software that retrieves important data from a business pitch deck (tl:dr version). Part of this involves getting text out of a PDF. That part is simple enough but keeping the data LLM readable is not.