ashutoshvarma / pyxpdfLinks
Fast and memory-efficient Python PDF Parser based on xpdf sources
☆44Updated 2 years ago
Alternatives and similar repositories for pyxpdf
Users that are interested in pyxpdf are comparing it to the libraries listed below
Sorting:
- Python API for PDF documents☆124Updated last year
- Python binding to Poppler-cpp pdf library☆113Updated last year
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆123Updated 3 months ago
- Parse numbers written in natural language☆126Updated last year
- Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity☆76Updated 2 years ago
- Cython based high performance alternative to Python (re) module for doing basic pattern matching on large data-set..☆11Updated 3 years ago
- A Python tool to help extracting information from structured PDFs.☆427Updated 3 weeks ago
- Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.☆227Updated last month
- Python from the Nuitka project☆59Updated this week
- Pandoc (Python Library)☆178Updated 4 months ago
- Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.☆57Updated last year
- A Python implementation of Lunr.js 🌖☆204Updated 11 months ago
- A simple python wrapper for PDFium.☆17Updated 4 years ago
- A python module to split file into multiple chunks based on the given size.☆69Updated last year
- Convert HTML to JSON. Can also (intelligently) convert HTML tables to JSON (using table headers (if available) as keys in the resulting J…☆52Updated 2 years ago
- Find parts of long text or data, allowing for some changes/typos.☆339Updated 3 months ago
- Make PDFs easily☆324Updated 2 months ago
- mirror of https://hg.reportlab.com/hg-public/reportlab☆78Updated 3 weeks ago
- Porting Django's email implementation to your FastAPI applications.☆20Updated 2 months ago
- A Python binding of SQLite Full Text Search Tokenizer☆50Updated 2 months ago
- Friendlier Python tracebacks.☆88Updated 10 months ago
- Safely evaluate AST nodes without side effects☆50Updated last month
- CyDifflib is a fast implementation of difflib's algorithms, which can be used as a drop-in replacement.☆31Updated 10 months ago
- Fast Autocomplete: When Elastcsearch suggestions are not fast and flexible enough☆287Updated 5 months ago
- A high performance python hash table library that is generally faster and consumes significantly less memory than Python Dictionaries☆214Updated 2 years ago
- Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded s…☆142Updated 4 years ago
- Python3 GUI toolkit for building "beautiful" applications for mobile, web, and desktop from a single codebase☆109Updated last month
- Python Simple Dialogs☆41Updated 2 years ago
- Truly universal encoding detector in pure Python.☆735Updated last week
- A purely-functional HTML builder for Python. Think JSX rather than templates.☆102Updated last year