innodatalabs / redstorkLinks
Parsing PDF files with PDFium
☆12Updated last year
Alternatives and similar repositories for redstork
Users that are interested in redstork are comparing it to the libraries listed below
Sorting:
- A simple python wrapper for PDFium.☆17Updated 4 years ago
- Python API for PDF documents☆124Updated last year
- A Python binding of SQLite Full Text Search Tokenizer☆50Updated 2 months ago
- Python binding to Poppler-cpp pdf library☆113Updated last year
- A low-level PDF creator☆140Updated this week
- CFFI-based cairo bindings for Python.☆211Updated 2 months ago
- A library for working with HTML/CSS color formats in Python.☆172Updated 3 months ago
- Allowlist-based HTML cleaner☆153Updated 7 months ago
- A tiny CSS parser☆183Updated 2 months ago
- Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.☆344Updated last year
- Convert html to docx☆87Updated last year
- ☆86Updated 8 months ago
- XPath 1.0/2.0/3.0/3.1 parsers and selectors for ElementTree and lxml☆88Updated 3 weeks ago
- A python package for grapheme aware string handling☆115Updated 3 years ago
- Find and use proxy auto-config (PAC) files with Python and Requests.☆74Updated 4 months ago
- An extendable docx file format parser and converter☆195Updated 8 months ago
- xmlsjon converts XML into Python dictionary structures (trees, like in JSON) and vice-versa.☆124Updated 8 months ago
- Postmark library for python 2 and 3. Built on top of the requests library.☆24Updated 2 years ago
- Complete lxml external type annotation☆79Updated last week
- Python module for interacting with nested dicts as a single level dict with delimited keys.☆115Updated 2 years ago
- a Python implementation of the Unicode Collation Algorithm☆224Updated last year
- A HarfBuzz Python binding☆90Updated 2 weeks ago
- Python package for Google's diff-match-patch native C++ implementation.☆87Updated last year
- The PyICU project repository has moved to https://pyicu.org.☆138Updated 4 years ago
- A fast, comprehensive, ISO 639 library.☆47Updated 6 months ago
- Python implementation of core ProseMirror modules☆55Updated 2 months ago
- Python CSS-to-inline-styles conversion tool for HTML using BeautifulSoup and cssutils☆183Updated 5 years ago
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)☆68Updated 2 years ago
- Read SVG files and convert them to other formats.☆358Updated this week
- Parallel and LAzY Analyzer for PDFs 🏖️☆38Updated last week