cbrunet / python-popplerLinks
Python binding to Poppler-cpp pdf library
☆110Updated 10 months ago
Alternatives and similar repositories for python-poppler
Users that are interested in python-poppler are comparing it to the libraries listed below
Sorting:
- Python API for PDF documents☆124Updated 10 months ago
- A Python tool to help extracting information from structured PDFs.☆408Updated 2 weeks ago
- Truly universal encoding detector in pure Python☆676Updated this week
- A Python implementation of Lunr.js 🌖☆198Updated 4 months ago
- Pandoc (Python Library)☆161Updated 10 months ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆115Updated 4 months ago
- ☆522Updated 2 months ago
- Extract docx headers, footers, (formatted) text, footnotes, endnotes, properties, and images.☆187Updated last week
- mirror of https://hg.reportlab.com/hg-public/reportlab☆74Updated last week
- A Python library to sanitize/validate a string such as filenames/file-paths/etc.☆265Updated last month
- Read SVG files and convert them to other formats.☆343Updated 2 months ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆340Updated 3 months ago
- A simple python wrapper for PDFium.☆17Updated 3 years ago
- Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity☆72Updated last year
- A fast, comprehensive, ISO 639 library.☆43Updated last week
- A utility to read and write PDFs with Python☆336Updated 3 years ago
- Pure-Python full-text search library☆633Updated last year
- A modern CSS selector implementation for BeautifulSoup☆244Updated this week
- Simple python wrapper to convert HTML to PDF with headless Chrome via selenium☆74Updated 7 months ago
- rstr is a helper module for easily generating random strings of various types. It could be useful for fuzz testing, generating dummy data…☆94Updated 5 months ago
- A streaming multipart parser for Python.☆419Updated 3 months ago
- Pure python implementation of identifying files based off their magic numbers☆204Updated 3 weeks ago
- Python interface to Apache PDFBox command-line tools.☆75Updated 2 years ago
- Complete lxml external type annotation☆67Updated last week
- Pure-python library for adding annotations to PDFs☆204Updated 4 years ago
- SQLite3 DB-API 2.0 driver from Python 3, packaged separately, with improvements☆211Updated 3 months ago
- Convert html to docx☆81Updated last year
- Simplify DOCX files to JSON☆245Updated 10 months ago
- Append/Concatenate .docx documents☆121Updated last year
- python library to simplify working with jsonlines and ndjson data☆296Updated 11 months ago