How to de-duplicate images in PDF pages using pypdf?
I’m working on a Python script to merge PDF files, and I need some help with de-duplicating images on each page. I’m using the pypdf
library and running into an issue where multiple instances of the same image are detected on each page. I want to achieve functionality similar to Adobe Acrobat’s “Compress” feature, which ensures only one instance of each image per page.