Relative Content

Tag Archive for pythonpdflarge-language-model

Looking for ways to extract text from complicated PDF’s

I am currently developing a software that retrieves important data from a business pitch deck (tl:dr version). Part of this involves getting text out of a PDF. That part is simple enough but keeping the data LLM readable is not.