YinlinHu / pypdfium
A simple python wrapper for PDFium.
☆17Updated 3 years ago
Alternatives and similar repositories for pypdfium:
Users that are interested in pypdfium are comparing it to the libraries listed below
- Parsing PDF files with PDFium☆12Updated 5 months ago
- Python binding to Poppler-cpp pdf library☆109Updated 7 months ago
- Fast and memory-efficient Python PDF Parser based on xpdf sources☆42Updated last year
- A Python tool to help extracting information from structured PDFs.☆402Updated 2 weeks ago
- A better PDF Extraction Tool using the latest and fastest python features☆22Updated 8 months ago
- Tools for extract figure, table, text, .. from a pdf document.☆32Updated 4 years ago
- Mail merge for Office Open XML (docx) files without the need for Microsoft Office Word.☆67Updated 3 months ago
- Python interface to Apache PDFBox command-line tools.☆75Updated 2 years ago
- mirror of https://hg.reportlab.com/hg-public/reportlab☆72Updated last week
- Parallel and LAzY Analyzer for PDFs 🏖️☆26Updated this week
- A step-by-step C# implementation of the Docstrum algorithm☆23Updated 4 years ago
- Complete lxml external type annotation☆58Updated this week
- Python API for PDF documents☆119Updated 7 months ago
- A Python binding of SQLite Full Text Search Tokenizer☆47Updated 2 months ago
- A low-level PDF creator☆124Updated 5 months ago
- ☆16Updated 3 months ago
- A simpler, faster ISO 639 library.☆37Updated 2 months ago
- Detect textlines in document images☆92Updated 10 months ago
- ☆10Updated 4 years ago
- Pure-python library for adding annotations to PDFs☆201Updated 4 years ago
- Pipeline for converting PDFs to raw text with PaddleOCR☆22Updated last year
- Python binding to libpoppler-qt5☆43Updated last year
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆389Updated 8 months ago
- DFKI Layout Detection for OCR-D☆47Updated last week
- Document Layout Analysis☆365Updated this week
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated 2 weeks ago
- ☆20Updated 5 months ago
- 🎨 Type-safe and powerful Python library to generate SVG files☆304Updated last month
- python module to manipulate text, strings and list of strings☆20Updated 2 years ago
- OCR-D-compliant page segmentation☆67Updated last month