YinlinHu / pypdfium
A simple python wrapper for PDFium.
☆17Updated 3 years ago
Alternatives and similar repositories for pypdfium
Users that are interested in pypdfium are comparing it to the libraries listed below
Sorting:
- Parsing PDF files with PDFium☆12Updated 6 months ago
- Python binding to Poppler-cpp pdf library☆110Updated 8 months ago
- Python bindings to PDFium☆568Updated this week
- Python binding to libpoppler-qt5☆43Updated last year
- Convert omml to latex for displaying in web browsers (KaTeX)☆31Updated 4 years ago
- A tool for converting PDF into hOCR with text, tables, and figures being recognized and preserved.☆444Updated last year
- A low-level PDF creator☆125Updated 5 months ago
- Python interface to Apache PDFBox command-line tools.☆75Updated 2 years ago
- Fast and memory-efficient Python PDF Parser based on xpdf sources☆42Updated last year
- Python CFFI wrapper for LibreOfficeKit☆56Updated 5 years ago
- Read SVG files and convert them to other formats.☆341Updated 4 months ago
- A Python tool to help extracting information from structured PDFs.☆403Updated last month
- uchardet is an encoding detector library, which takes a sequence of bytes in an unknown character encoding and attempts to determine the …☆44Updated 11 months ago
- Pure-python library for adding annotations to PDFs☆202Updated 4 years ago
- A Python binding of SQLite Full Text Search Tokenizer☆48Updated 2 weeks ago
- A fast, comprehensive, ISO 639 library.☆38Updated 2 months ago
- mirror of https://hg.reportlab.com/hg-public/reportlab☆72Updated this week
- Python API for PDF documents☆121Updated 8 months ago
- Files which can be used to test PDF readers☆39Updated last month
- XPath 1.0/2.0/3.0/3.1 parsers and selectors for ElementTree and lxml☆81Updated this week
- A step-by-step C# implementation of the Docstrum algorithm☆23Updated 4 years ago
- A tiny CSS parser☆176Updated 3 months ago
- Collection of OCR-related python tools and wrappers from @OCR-D☆128Updated last week
- Tutorial on how to deskew (straighten) text images☆51Updated 3 years ago
- Command-line tool for exploring and diagnosing problems with Microsoft Office Open XML files (.docx, .pptx, .xlsx)☆53Updated 7 months ago
- Python 3 bindings for SQLCipher☆105Updated 6 months ago
- Python library for parsing .docx (Office Open XML) files☆51Updated 5 years ago
- Tools for extract figure, table, text, .. from a pdf document.☆32Updated 4 years ago
- Pyfilesystem2 implementation for OneDrive☆10Updated last month
- A better PDF Extraction Tool using the latest and fastest python features☆22Updated 9 months ago