maxpmaxp / pdfreaderLinks
Python API for PDF documents
☆122Updated 9 months ago
Alternatives and similar repositories for pdfreader
Users that are interested in pdfreader are comparing it to the libraries listed below
Sorting:
- A Python tool to help extracting information from structured PDFs.☆404Updated 2 months ago
- Python binding to Poppler-cpp pdf library☆108Updated 8 months ago
- Python interface to Apache PDFBox command-line tools.☆75Updated 2 years ago
- A Python implementation of Lunr.js 🌖☆195Updated 2 months ago
- A utility to read and write PDFs with Python☆334Updated 3 years ago
- A better PDF Extraction Tool using the latest and fastest python features☆22Updated 10 months ago
- Parse numbers written in natural language☆116Updated 7 months ago
- A fast, comprehensive, ISO 639 library.☆38Updated 3 months ago
- Demos, examples and utilities using PyMuPDF☆663Updated 11 months ago
- Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity☆72Updated last year
- Pure-python library for adding annotations to PDFs☆202Updated 4 years ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆73Updated last month
- A tool for converting PDF into hOCR with text, tables, and figures being recognized and preserved.☆448Updated last year
- Python bindings to PDFium☆578Updated last week
- A curated list of resources around PDF files☆133Updated 10 months ago
- rstr is a helper module for easily generating random strings of various types. It could be useful for fuzz testing, generating dummy data…☆93Updated 3 months ago
- Python 3 fork of pdfminer/pdfminer.six.☆46Updated 3 years ago
- Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.☆54Updated 5 months ago
- Create and modify Word documents with Python☆145Updated 11 months ago
- ☆171Updated 2 months ago
- Efficient string matching with regular expressions☆143Updated 2 weeks ago
- A modern CSS selector implementation for BeautifulSoup☆238Updated 2 weeks ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆153Updated last year
- Fast and memory-efficient Python PDF Parser based on xpdf sources☆42Updated last year
- A Python binding of SQLite Full Text Search Tokenizer☆48Updated last month
- mirror of https://hg.reportlab.com/hg-public/reportlab☆73Updated 3 weeks ago
- A utility to read and write PDFs with Python☆73Updated 10 months ago
- 🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)☆468Updated 4 months ago
- Append/Concatenate .docx documents☆111Updated 10 months ago
- Extract dates from text☆64Updated 4 years ago