cbrunet / python-poppler
Python binding to Poppler-cpp pdf library
β107Updated 6 months ago
Alternatives and similar repositories for python-poppler:
Users that are interested in python-poppler are comparing it to the libraries listed below
- Python API for PDF documentsβ118Updated 6 months ago
- A Python implementation of Lunr.js πβ196Updated last week
- Python bindings to PDFiumβ547Updated this week
- Pandoc (Python Library)β150Updated 6 months ago
- A Python tool to help extracting information from structured PDFs.β399Updated 2 weeks ago
- Python library for fast approximate string matching using Jaro and Jaro-Winkler similarityβ69Updated last year
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarityβ112Updated 2 weeks ago
- Python interface to Apache PDFBox command-line tools.β75Updated 2 years ago
- mirror of https://hg.reportlab.com/hg-public/reportlabβ72Updated this week
- Convert html to docxβ77Updated 8 months ago
- Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.β52Updated 2 months ago
- Python Dependency Graphsβ78Updated last year
- A simpler, faster ISO 639 library.β36Updated last month
- Fast and memory-efficient Python PDF Parser based on xpdf sourcesβ41Updated last year
- Parse numbers written in natural languageβ109Updated 4 months ago
- A utility to read and write PDFs with Pythonβ72Updated 8 months ago
- Pure python implementation of identifying files based off their magic numbersβ182Updated 4 months ago
- Type stubs for the lxml packageβ46Updated 9 months ago
- Integer to Roman numerals converterβ46Updated 2 months ago
- A Python library for working with and comparing language codes.β344Updated 3 months ago
- A low-level PDF creatorβ123Updated 4 months ago
- An in-memory database of Python objects, searchable using quasi-SQL APIβ165Updated 2 weeks ago
- Document Layout Analysisβ360Updated this week
- Append/Concatenate .docx documentsβ107Updated 7 months ago
- rstr is a helper module for easily generating random strings of various types. It could be useful for fuzz testing, generating dummy dataβ¦β91Updated last month
- Auto documentation for MkDocs πβ225Updated 2 years ago
- β84Updated 3 weeks ago
- Extract docx headers, footers, (formatted) text, footnotes, endnotes, properties, and images.β178Updated this week
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.β389Updated 7 months ago
- Python package for Google's diff-match-patch native C++ implementation.β74Updated 9 months ago