maxpmaxp / pdfreader
Python API for PDF documents
☆113Updated 2 weeks ago
Related projects: ⓘ
- Python binding to Poppler-cpp pdf library☆95Updated last week
- A Python tool to help extracting information from structured PDFs.☆368Updated 3 weeks ago
- Python interface to Apache PDFBox command-line tools.☆75Updated last year
- A Python implementation of Lunr.js 🌖☆188Updated last week
- Python bindings to PDFium☆349Updated this week
- Stripping rtf to plain old text☆90Updated 3 weeks ago
- Pandoc (Python Library)☆135Updated last week
- A utility to read and write PDFs with Python☆330Updated 2 years ago
- Extract docx headers, footers, (formatted) text, footnotes, endnotes, properties, and images.☆158Updated this week
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆148Updated last year
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆66Updated 2 weeks ago
- ☆159Updated 3 months ago
- A Python library for working with and comparing language codes.☆339Updated 5 months ago
- Fast and memory-efficient Python PDF Parser based on xpdf sources☆40Updated 9 months ago
- Demos, examples and utilities using PyMuPDF☆548Updated 2 months ago
- Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity☆63Updated 8 months ago
- A curated list of resources around PDF files☆89Updated last month
- Append/Concatenate .docx documents☆101Updated last month
- rstr is a helper module for easily generating random strings of various types. It could be useful for fuzz testing, generating dummy data…☆86Updated 10 months ago
- mirror of https://hg.reportlab.com/hg-public/reportlab☆66Updated this week
- Efficient string matching with regular expressions☆139Updated last month
- Library for unit extraction - fork of quantulum for python3☆134Updated 2 months ago
- Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.☆46Updated last year
- A better PDF Extraction Tool using the latest and fastest python features☆22Updated last month
- A python package for grapheme aware string handling☆104Updated 2 years ago
- 🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)☆425Updated 2 months ago
- SQLite3 DB-API 2.0 driver from Python 3, packaged separately, with improvements☆183Updated 3 months ago
- Guess gender from first name in Python 2 and 3☆129Updated 2 years ago
- ASCII transliterations of Unicode text - GitHub mirror☆517Updated 4 months ago
- A fast, simple ISO 639 library.☆32Updated 3 weeks ago