cbrunet / python-poppler
Python binding to Poppler-cpp pdf library
☆105Updated 5 months ago
Alternatives and similar repositories for python-poppler:
Users that are interested in python-poppler are comparing it to the libraries listed below
- Python API for PDF documents☆118Updated 5 months ago
- mirror of https://hg.reportlab.com/hg-public/reportlab☆71Updated this week
- A simpler, faster ISO 639 library.☆34Updated 3 months ago
- A Python tool to help extracting information from structured PDFs.☆394Updated this week
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆107Updated last month
- Python interface to Apache PDFBox command-line tools.☆75Updated 2 years ago
- XPath 1.0/2.0/3.0/3.1 parsers and selectors for ElementTree and lxml☆76Updated last month
- A modern CSS selector implementation for BeautifulSoup☆224Updated this week
- Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.☆51Updated last month
- A python package for grapheme aware string handling☆110Updated 2 years ago
- Fast and memory-efficient Python PDF Parser based on xpdf sources☆41Updated last year
- Advanced Enumerations for Python☆189Updated last year
- Asynchronous version of functions of shutil module.☆37Updated 6 months ago
- Convert html to docx☆76Updated 7 months ago
- Library used to deskew a scanned document☆436Updated this week
- The private PyPI server powered by flexible backends.☆180Updated 4 years ago
- Backport of PEP 654 (exception groups)☆42Updated this week
- (not ready yet) A simple but powerful job scheduler for Trio programs☆66Updated 4 years ago
- Parsing PDF files with PDFium☆12Updated 3 months ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆296Updated last month
- A Python implementation of Lunr.js 🌖☆196Updated last month
- Append/Concatenate .docx documents☆106Updated 6 months ago
- Pandoc (Python Library)☆147Updated 5 months ago
- Collection of OCR-related python tools and wrappers from @OCR-D☆125Updated this week
- Parse numbers written in natural language☆109Updated 3 months ago
- Python difflib with parts reimplemented in C☆33Updated last month
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆381Updated 6 months ago
- Pure python implementation of identifying files based off their magic numbers☆180Updated 3 months ago
- A simple python wrapper for PDFium.☆16Updated 3 years ago
- Schema.org classes in pydantic☆65Updated 2 years ago