cbrunet / python-popplerLinks
Python binding to Poppler-cpp pdf library
☆113Updated last year
Alternatives and similar repositories for python-poppler
Users that are interested in python-poppler are comparing it to the libraries listed below
Sorting:
- A Python tool to help extracting information from structured PDFs.☆422Updated this week
- Python API for PDF documents☆124Updated last year
- Pandoc (Python Library)☆174Updated last month
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆121Updated 2 weeks ago
- A utility to read and write PDFs with Python☆338Updated 3 years ago
- Extract docx headers, footers, (formatted) text, footnotes, endnotes, properties, and images.☆193Updated last week
- Python bindings to PDFium, reasonably cross-platform.☆668Updated this week
- Fast Base64 encoding/decoding in Python☆162Updated this week
- mirror of https://hg.reportlab.com/hg-public/reportlab☆75Updated last month
- A Python implementation of Lunr.js 🌖☆200Updated 8 months ago
- ☆551Updated last week
- Truly universal encoding detector in pure Python.☆716Updated this week
- rstr is a helper module for easily generating random strings of various types. It could be useful for fuzz testing, generating dummy data…☆94Updated 8 months ago
- Simplify DOCX files to JSON☆254Updated last year
- Pure-python library for adding annotations to PDFs☆209Updated 4 years ago
- Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.☆225Updated this week
- Pure python implementation of identifying files based off their magic numbers☆219Updated 4 months ago
- A fast, comprehensive, ISO 639 library.☆44Updated 3 months ago
- XPath 1.0/2.0/3.0/3.1 parsers and selectors for ElementTree and lxml☆86Updated 3 weeks ago
- A Python library to sanitize/validate a string such as filenames/file-paths/etc.☆278Updated 5 months ago
- Read SVG files and convert them to other formats.☆349Updated this week
- Pure-Python full-text search library☆646Updated last year
- A streaming multipart parser for Python.☆446Updated 2 weeks ago
- Complete lxml external type annotation☆71Updated last week
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆362Updated 2 weeks ago
- Append/Concatenate .docx documents☆121Updated last year
- Parse numbers written in natural language☆123Updated last year
- A python based HTML to text conversion library, command line client and Web service.☆323Updated 3 weeks ago
- A modern CSS selector implementation for BeautifulSoup☆250Updated 2 months ago
- Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.☆55Updated 10 months ago