maxpmaxp / pdfreaderLinks
Python API for PDF documents
☆122Updated 9 months ago
Alternatives and similar repositories for pdfreader
Users that are interested in pdfreader are comparing it to the libraries listed below
Sorting:
- A Python tool to help extracting information from structured PDFs.☆404Updated this week
- Python binding to Poppler-cpp pdf library☆110Updated 9 months ago
- Python interface to Apache PDFBox command-line tools.☆75Updated 2 years ago
- A Python implementation of Lunr.js 🌖☆197Updated 3 months ago
- Extract docx headers, footers, (formatted) text, footnotes, endnotes, properties, and images.☆183Updated last week
- A Python library to extract tabular data from PDFs☆65Updated 2 months ago
- A utility to read and write PDFs with Python☆334Updated 3 years ago
- rstr is a helper module for easily generating random strings of various types. It could be useful for fuzz testing, generating dummy data…☆94Updated 4 months ago
- Append/Concatenate .docx documents☆114Updated 10 months ago
- This project uses SLICE algorithm to extract information from a text-based PDF page containing financial statements (tabular data). It ca…☆64Updated 3 years ago
- Easy rate-limiting for python requests☆102Updated last week
- Convert html to docx☆81Updated 11 months ago
- A python package for grapheme aware string handling☆112Updated 3 years ago
- A fast, comprehensive, ISO 639 library.☆39Updated 4 months ago
- python library to simplify working with jsonlines and ndjson data☆294Updated 10 months ago
- Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.☆207Updated last week
- Fast and memory-efficient Python PDF Parser based on xpdf sources☆42Updated last year
- List of all countries with names and ISO 3166-1 codes in all languages.☆26Updated 2 months ago
- ☆85Updated last month
- Parse numbers written in natural language☆117Updated 8 months ago
- A Python library for working with and comparing language codes.☆345Updated last month
- Simplify DOCX files to JSON☆241Updated 8 months ago
- Efficient string matching with regular expressions☆143Updated last week
- ☆171Updated 2 months ago
- Pythonic search engine based on PyLucene.☆128Updated 7 months ago
- Library for unit extraction - fork of quantulum for python3☆141Updated last year
- ASCII transliterations of Unicode text - GitHub mirror☆569Updated 2 months ago
- Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity☆72Updated last year
- Demos, examples and utilities using PyMuPDF☆664Updated 11 months ago
- URL normalization for Python☆97Updated last month