YinlinHu / pypdfiumLinks
A simple python wrapper for PDFium.
☆17Updated 4 years ago
Alternatives and similar repositories for pypdfium
Users that are interested in pypdfium are comparing it to the libraries listed below
Sorting:
- Parsing PDF files with PDFium☆12Updated last year
- Python binding to Poppler-cpp pdf library☆114Updated last year
- Python bindings to PDFium, reasonably cross-platform.☆704Updated this week
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆405Updated last year
- An extendable docx file format parser and converter☆194Updated 7 months ago
- Python interface to Apache PDFBox command-line tools.☆78Updated 2 years ago
- A Python tool to help extracting information from structured PDFs.☆427Updated 3 weeks ago
- Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.☆226Updated 2 weeks ago
- Library used to deskew a scanned document☆495Updated this week
- Python API for PDF documents☆124Updated last year
- A step-by-step C# implementation of the Docstrum algorithm☆24Updated 5 years ago
- Convert your vector images☆894Updated 7 months ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆370Updated 3 weeks ago
- Read SVG files and convert them to other formats.☆354Updated 2 weeks ago
- Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.☆335Updated last year
- Pure-Python full-text search library☆650Updated 2 years ago
- Pure-python library for adding annotations to PDFs☆212Updated 4 years ago
- Simplify DOCX files to JSON☆257Updated last year
- Truly universal encoding detector in pure Python.☆724Updated this week
- Fast and memory-efficient Python PDF Parser based on xpdf sources☆44Updated 2 years ago
- Extract docx headers, footers, (formatted) text, footnotes, endnotes, properties, and images.☆197Updated 2 weeks ago
- Convert Word documents (.docx files) to HTML☆1,043Updated last month
- CFFI-based cairo bindings for Python.☆211Updated 3 weeks ago
- PDF to XML ALTO file converter☆259Updated last week
- Tools for extract figure, table, text, .. from a pdf document.☆34Updated 5 years ago
- ☆566Updated 2 months ago
- Convert omml to latex for displaying in web browsers (KaTeX)☆35Updated 5 years ago
- A Python binding of SQLite Full Text Search Tokenizer☆49Updated last month
- A utility to read and write PDFs with Python☆338Updated 4 years ago
- Python bindings for Tantivy☆381Updated last week