ashutoshvarma / pyxpdf
Fast and memory-efficient Python PDF Parser based on xpdf sources
☆42Updated last year
Alternatives and similar repositories for pyxpdf
Users that are interested in pyxpdf are comparing it to the libraries listed below
Sorting:
- Python binding to Poppler-cpp pdf library☆110Updated 8 months ago
- Python difflib with parts reimplemented in C☆38Updated 4 months ago
- Python API for PDF documents☆121Updated 8 months ago
- Loadable spellfix1 extension for sqlite as python package☆26Updated last year
- A simple python wrapper for PDFium.☆17Updated 3 years ago
- Fastest general-purpose parsing library for Python with a familiar API☆44Updated 3 months ago
- CyDifflib is a fast implementation of difflib's algorithms, which can be used as a drop-in replacement.☆23Updated last month
- A simple Python wrapper around QOI (https://github.com/phoboslab/qoi)☆79Updated 3 months ago
- A purely-functional HTML builder for Python. Think JSX rather than templates.☆98Updated 4 months ago
- Efficient string matching with regular expressions☆143Updated last week
- python module to manipulate text, strings and list of strings☆20Updated 3 years ago
- Python module for accessing databases using the ODBC API.☆11Updated 9 months ago
- Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.☆203Updated last month
- Custom Python functions for working with SQLite FTS4☆22Updated 2 years ago
- mirror of https://hg.reportlab.com/hg-public/reportlab☆72Updated this week
- Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity☆71Updated last year
- Quickly check whether there is a visible difference between two PDFs.☆68Updated last month
- Faster, modernized fork of the language identification tool langid.py☆55Updated 5 months ago
- convtools is a specialized Python library for dynamic, declarative data transformations with automatic code generation☆40Updated last month
- A fast RLock implementation for CPython☆28Updated 4 months ago
- A pure Python Levenshtein implementation that's not freaking GPL'd.☆97Updated 2 years ago
- A Pure Python PDFViewer, which provides functionalities same as other famous PDFViewers.☆83Updated last year
- Parse numbers written in natural language☆114Updated 6 months ago
- A better PDF Extraction Tool using the latest and fastest python features☆22Updated 9 months ago
- Easy to use pattern matching and information extraction for Python☆40Updated last year
- Advanced multiple dispatch for Python functions☆28Updated 2 weeks ago
- Yet another Python web framework☆44Updated 7 months ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆114Updated 2 months ago
- MetaDict is a powerful dict subclass enabling (nested) attribute-style item access/assignment and IDE autocompletion support.☆35Updated 2 years ago
- Easy and extensible query parser☆30Updated last year