YinlinHu / pypdfiumLinks
A simple python wrapper for PDFium.
☆17Updated 3 years ago
Alternatives and similar repositories for pypdfium
Users that are interested in pypdfium are comparing it to the libraries listed below
Sorting:
- Parsing PDF files with PDFium☆12Updated 8 months ago
- Python binding to Poppler-cpp pdf library☆110Updated 10 months ago
- Python bindings to PDFium. Reasonably cross-platform.☆596Updated this week
- A Python tool to help extracting information from structured PDFs.☆407Updated 3 weeks ago
- A utility to read and write PDFs with Python☆335Updated 3 years ago
- Python API for PDF documents☆123Updated 10 months ago
- A tiny CSS parser☆178Updated 5 months ago
- ☆22Updated 8 months ago
- XPath 1.0/2.0/3.0/3.1 parsers and selectors for ElementTree and lxml☆84Updated 3 weeks ago
- An extendable docx file format parser and converter☆192Updated 2 months ago
- ☆522Updated 2 months ago
- Python binding to libpoppler-qt5☆43Updated last year
- Simple python wrapper to convert HTML to PDF with headless Chrome via selenium☆73Updated 6 months ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆395Updated 11 months ago
- A tool for converting PDF into hOCR with text, tables, and figures being recognized and preserved.☆450Updated last year
- Demos, examples and utilities using PyMuPDF☆669Updated last year
- Convert html to docx☆81Updated last year
- A HarfBuzz Python binding☆82Updated 2 weeks ago
- Fast and memory-efficient Python PDF Parser based on xpdf sources☆42Updated last year
- Pure-python library for adding annotations to PDFs☆204Updated 4 years ago
- A better PDF Extraction Tool using the latest and fastest python features☆22Updated 11 months ago
- Python interface to Apache PDFBox command-line tools.☆75Updated 2 years ago
- A Python binding of SQLite Full Text Search Tokenizer☆49Updated 2 months ago
- Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.☆207Updated last month
- Extract docx headers, footers, (formatted) text, footnotes, endnotes, properties, and images.☆185Updated this week
- Read SVG files and convert them to other formats.☆343Updated last month
- ☆85Updated 2 months ago
- Complete lxml external type annotation☆63Updated 2 weeks ago
- A low-level PDF creator☆131Updated 8 months ago
- CFFI-based cairo bindings for Python.☆209Updated 7 months ago