YinlinHu / pypdfiumLinks
A simple python wrapper for PDFium.
☆17Updated 3 years ago
Alternatives and similar repositories for pypdfium
Users that are interested in pypdfium are comparing it to the libraries listed below
Sorting:
- Parsing PDF files with PDFium☆12Updated last year
- Python binding to Poppler-cpp pdf library☆113Updated last year
- A Python tool to help extracting information from structured PDFs.☆422Updated last week
- Python interface to Apache PDFBox command-line tools.☆78Updated 2 years ago
- An extendable docx file format parser and converter☆193Updated 6 months ago
- Simplify DOCX files to JSON☆256Updated last year
- Fast and memory-efficient Python PDF Parser based on xpdf sources☆43Updated last year
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆403Updated last year
- XPath 1.0/2.0/3.0/3.1 parsers and selectors for ElementTree and lxml☆86Updated last month
- Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.☆225Updated last week
- Python API for PDF documents☆125Updated last year
- Convert html to docx☆83Updated last year
- A tool for converting PDF into hOCR with text, tables, and figures being recognized and preserved.☆454Updated 2 years ago
- Parallel and LAzY Analyzer for PDFs 🏖️☆35Updated last week
- Read SVG files and convert them to other formats.☆349Updated last week
- Pure-python library for adding annotations to PDFs☆209Updated 4 years ago
- A tiny CSS parser☆180Updated 2 months ago
- Convert Word documents (.docx files) to HTML☆1,022Updated 2 months ago
- Pandoc (Python Library)☆174Updated last month
- Convert your vector images☆877Updated 6 months ago
- PDF to XML ALTO file converter☆255Updated last week
- Mail merge for Office Open XML (docx) files without the need for Microsoft Office Word.☆75Updated this week
- Complete lxml external type annotation☆71Updated this week
- Truly universal encoding detector in pure Python.☆717Updated last week
- mirror of https://hg.reportlab.com/hg-public/reportlab☆75Updated this week
- Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.☆326Updated last year
- Annotation layer for pdf.js☆289Updated last year
- A utility to read and write PDFs with Python☆338Updated 3 years ago
- A Python binding of SQLite Full Text Search Tokenizer☆48Updated last month
- API for OpenDocument in Python☆346Updated 2 months ago