ashutoshvarma / pyxpdf
Fast and memory-efficient Python PDF Parser based on xpdf sources
☆40Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for pyxpdf
- Python API for PDF documents☆116Updated 2 months ago
- Python binding to Poppler-cpp pdf library☆97Updated 2 months ago
- Loadable spellfix1 extension for sqlite as python package☆25Updated 6 months ago
- ☆44Updated 2 months ago
- Easy to use pattern matching and information extraction for Python☆38Updated 11 months ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆67Updated last week
- Visual Automata is a Python 3 library built as a wrapper for the Automata library to add more visualization features.☆56Updated last year
- Safely evaluate AST nodes without side effects☆42Updated 3 months ago
- Easy creation of custom import hooks to experiment on alternatives to Python's syntax; see https://aroberge.github.io/ideas/docs/html/☆79Updated 10 months ago
- WASM-powered sandbox implementation of exec() for safely running dynamic Python code☆32Updated 9 months ago
- Custom Python functions for working with SQLite FTS4☆22Updated 2 years ago
- Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity☆64Updated 10 months ago
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated 2 months ago
- User contributed (non Google) OCR models for Tesseract☆22Updated 2 weeks ago
- Lightweight pure-Python package to show simple dialogs.☆21Updated 4 years ago
- A python package for grapheme aware string handling☆108Updated 2 years ago
- Python-based drawing tool for making sketches of mathematical and scientific problems.☆28Updated this week
- A purely-functional HTML builder for Python. Think JSX rather than templates.☆93Updated 3 months ago
- XPath 1.0/2.0/3.0/3.1 parsers and selectors for ElementTree and lxml☆72Updated last week
- A linting/refactoring library for python best practices and lesser-known tricks☆30Updated 2 years ago
- Fastest general-purpose parsing library for Python with a familiar API☆43Updated 2 months ago
- Parse numbers written in natural language☆109Updated 2 weeks ago
- Efficient string matching with regular expressions☆138Updated this week
- Faster, modernized fork of the language identification tool langid.py☆48Updated 4 months ago
- python module to manipulate text, strings and list of strings☆16Updated 2 years ago
- Python Simple Object Storage - provides a list and dictionary interface that seamlessly stores data in a file, like a simplified database…☆53Updated last year
- A Python binding of SQLite Full Text Search Tokenizer☆45Updated last month
- Launch HTML5 apps in the browser or a desktop-like runtime.☆43Updated 3 years ago
- Python difflib with parts reimplemented in C☆32Updated 2 years ago
- pdfrw is a pure Python library that reads and writes PDFs☆30Updated 2 years ago