YinlinHu / pypdfiumLinks
A simple python wrapper for PDFium.
☆17Updated 3 years ago
Alternatives and similar repositories for pypdfium
Users that are interested in pypdfium are comparing it to the libraries listed below
Sorting:
- Parsing PDF files with PDFium☆12Updated 11 months ago
- Python binding to Poppler-cpp pdf library☆113Updated last year
- Convert omml to latex for displaying in web browsers (KaTeX)☆34Updated 5 years ago
- A Python tool to help extracting information from structured PDFs.☆417Updated last week
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆401Updated last year
- Python API for PDF documents☆124Updated last year
- Read SVG files and convert them to other formats.☆347Updated this week
- Fast and memory-efficient Python PDF Parser based on xpdf sources☆42Updated last year
- A utility to read and write PDFs with Python☆338Updated 3 years ago
- Library used to deskew a scanned document☆489Updated 3 weeks ago
- A better PDF Extraction Tool using the latest and fastest python features☆22Updated last year
- Pure-python library for adding annotations to PDFs☆208Updated 4 years ago
- Parallel and LAzY Analyzer for PDFs 🏖️☆35Updated last month
- Tools for extract figure, table, text, .. from a pdf document.☆34Updated 4 years ago
- A Python binding of SQLite Full Text Search Tokenizer☆48Updated last week
- Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.☆221Updated this week
- An extendable docx file format parser and converter☆192Updated 5 months ago
- Convert your vector images☆870Updated 5 months ago
- A tiny CSS parser☆180Updated last month
- Tutorial on how to deskew (straighten) text images☆52Updated 3 years ago
- A low-level PDF creator☆138Updated 3 weeks ago
- ☆40Updated 5 years ago
- Python binding to libpoppler-qt5☆43Updated last year
- Convert html to docx☆83Updated last year
- Community maintained hooks for PyInstaller.☆111Updated last week
- Pandoc (Python Library)☆173Updated 3 weeks ago
- Document image dewarping library using a cubic sheet model☆175Updated last week
- Document Layout Analysis☆391Updated this week
- ☆820Updated 3 weeks ago
- OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR☆158Updated last month