py-pdf / awesome-pdf
A curated list of resources around PDF files
☆129Updated 9 months ago
Alternatives and similar repositories for awesome-pdf
Users that are interested in awesome-pdf are comparing it to the libraries listed below
Sorting:
- CLI tool to extract (meta)data from PDF and manipulate PDF files☆145Updated last week
- Demos, examples and utilities using PyMuPDF☆656Updated 10 months ago
- OCRmyPDF EasyOCR plugin☆84Updated last month
- Logical structure analysis for visually structured documents☆89Updated 2 years ago
- 🖍️ Highlight text in documents☆107Updated 3 weeks ago
- Extract docx headers, footers, (formatted) text, footnotes, endnotes, properties, and images.☆179Updated last week
- Extracting Semi-Structured Data from PDFs on a large scale☆51Updated 2 years ago
- Python library to extract tabular data from images and scanned PDFs☆278Updated 9 months ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆391Updated 9 months ago
- Docx tracked change redlines for the Python ecosystem.☆60Updated 10 months ago
- PDF to XML ALTO file converter☆237Updated this week
- A basic tool that extracts the structure from the PDF files of scientific articles.☆74Updated 3 years ago
- Adobe PDFServices python SDK Samples☆149Updated 6 months ago