YinlinHu / pypdfiumLinks
A simple python wrapper for PDFium.
☆17Updated 3 years ago
Alternatives and similar repositories for pypdfium
Users that are interested in pypdfium are comparing it to the libraries listed below
Sorting:
- Parsing PDF files with PDFium☆12Updated 6 months ago
- Python binding to Poppler-cpp pdf library☆108Updated 8 months ago
- Python bindings to PDFium☆578Updated last week
- A Python binding of SQLite Full Text Search Tokenizer☆48Updated last month
- XPath 1.0/2.0/3.0/3.1 parsers and selectors for ElementTree and lxml☆82Updated 3 weeks ago
- Mail merge for Office Open XML (docx) files without the need for Microsoft Office Word.☆69Updated 5 months ago
- Convert omml to latex for displaying in web browsers (KaTeX)☆31Updated 4 years ago
- An extendable docx file format parser and converter☆191Updated 2 weeks ago
- A Python tool to help extracting information from structured PDFs.☆404Updated 2 months ago
- Python interface to Apache PDFBox command-line tools.☆75Updated 2 years ago
- Generalized nested html element tree with recursive rendering☆36Updated 2 years ago
- Pure-python library for adding annotations to PDFs☆202Updated 4 years ago
- Python binding to libpoppler-qt5☆43Updated last year
- A utility to read and write PDFs with Python☆73Updated 10 months ago
- Python API for PDF documents☆122Updated 9 months ago
- Complete lxml external type annotation☆59Updated 3 weeks ago
- A better PDF Extraction Tool using the latest and fastest python features☆22Updated 10 months ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆392Updated 9 months ago
- Data URI manipulation made easy.☆55Updated 5 months ago
- Convert html to docx☆79Updated 10 months ago
- A tiny CSS parser☆176Updated 4 months ago
- Stripping rtf to plain old text☆101Updated 2 months ago
- Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.☆54Updated 5 months ago
- Simple python wrapper to convert HTML to PDF with headless Chrome via selenium☆72Updated 5 months ago
- A step-by-step C# implementation of the Docstrum algorithm☆23Updated 4 years ago
- Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.☆207Updated 3 weeks ago
- ☆29Updated last week
- A curated list of resources around PDF files☆133Updated 10 months ago
- Pyfilesystem2 implementation for OneDrive☆10Updated this week
- Fast javascript minifier for Python☆68Updated 2 weeks ago