cbrunet / python-poppler
Python binding to Poppler-cpp pdf library
☆97Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for python-poppler
- Python API for PDF documents☆116Updated 2 months ago
- A fast, simple ISO 639 library.☆33Updated 2 weeks ago
- A Python tool to help extracting information from structured PDFs.☆379Updated last week
- Python bindings to PDFium☆419Updated last week
- A modern CSS selector implementation for BeautifulSoup☆205Updated last month
- Python interface to Apache PDFBox command-line tools.☆75Updated last year
- Read SVG files and convert them to other formats.☆323Updated last week
- XPath 1.0/2.0/3.0/3.1 parsers and selectors for ElementTree and lxml☆72Updated last week
- A Python implementation of Lunr.js 🌖☆188Updated last week
- Simple, Pythonic extraction of text, shapes and images from PDFs☆78Updated 4 years ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆96Updated last week
- rstr is a helper module for easily generating random strings of various types. It could be useful for fuzz testing, generating dummy data…☆88Updated 11 months ago
- Extract docx headers, footers, (formatted) text, footnotes, endnotes, properties, and images.☆164Updated this week
- Pure python implementation of identifying files based off their magic numbers☆168Updated last week
- Auto documentation for MkDocs 📘☆219Updated 2 years ago
- Static analysis of Python import statements☆118Updated 3 weeks ago
- A pure Python Levenshtein implementation that's not freaking GPL'd.☆97Updated last year
- Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.☆49Updated last week
- Fast and memory-efficient Python PDF Parser based on xpdf sources☆40Updated 10 months ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆149Updated last year
- Parse natural language time expressions in python☆131Updated last year
- Pure-python library for adding annotations to PDFs☆196Updated 3 years ago
- A Python library for working with and comparing language codes.☆339Updated 7 months ago
- mirror of https://hg.reportlab.com/hg-public/reportlab☆69Updated 3 weeks ago
- Parse numbers written in natural language☆109Updated 2 weeks ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆67Updated last week
- Python Powerful Timeout Decorator that can be used safely on classes, methods, class methods☆152Updated 4 months ago
- Serialization library for Exceptions and Tracebacks.☆165Updated 3 months ago
- Python package for Google's diff-match-patch native C++ implementation.☆73Updated 4 months ago
- Advanced Enumerations for Python☆183Updated 9 months ago