asosnovsky / pdfmajor
A better PDF Extraction Tool using the latest and fastest python features
☆22Updated 6 months ago
Alternatives and similar repositories for pdfmajor:
Users that are interested in pdfmajor are comparing it to the libraries listed below
- Pandoc (Python Library)☆147Updated 5 months ago
- A Python library to load structured table data from files/strings/URL with various data format: CSV / Excel / Google-Sheets / HTML / JSON…☆107Updated last year
- Declare multi-table rules for SQLAlchemy update logic -- 40X more concise, Python for extensibility.☆45Updated this week
- Convert PDFs to high quality PNGs using PDFIUM☆10Updated 3 years ago
- A simple python wrapper for PDFium.☆16Updated 3 years ago
- mirror of https://hg.reportlab.com/hg-public/reportlab☆71Updated last week
- An implementation of DMN (Decision Model Notation) in Python☆41Updated 2 years ago
- pdfrw is a pure Python library that reads and writes PDFs☆30Updated 2 years ago
- Python library for extracting text from various file formats (for indexing).☆111Updated 3 years ago
- A Python tool to help extracting information from structured PDFs.☆394Updated this week
- Segno plugin to convert (Micro) QR Codes to Pillow/PIL☆36Updated last year
- python module to manipulate text, strings and list of strings☆17Updated 2 years ago
- Python binding to libpoppler with focus on text extraction☆97Updated 3 years ago
- Python binding to Poppler-cpp pdf library☆105Updated 5 months ago
- Python API for PDF documents☆118Updated 5 months ago
- Fast, lightweight Python database toolkit for SQLite, built with Cython.☆42Updated last year
- Modern internal tools. Defined, controlled, and deployed directly from backend code. No JavaScript. Secure.☆20Updated 3 years ago
- Regular Expression based parsers for extracting data from natural languages☆70Updated 7 years ago
- PDF Table Extractor - repository to hold revisable version of code from https://www.cvast.tuwien.ac.at/projects/pdf2table by Burcu Yildiz☆38Updated 11 months ago
- 🔏 People-centric secret management system, built to work with modern distributed version control systems.☆52Updated 2 years ago
- ☆37Updated 4 months ago
- Data driven report builder for the Python data ecosystem.☆87Updated last year
- A better static site generator.☆21Updated 10 months ago
- Next generation email box manager☆103Updated last year
- A Python Signal-Slot library inspired by Qt, featuring thread-safe communication, async support, and automatic connection type detection.…☆24Updated last month
- WASM-powered sandbox implementation of exec() for safely running dynamic Python code☆32Updated last year
- Define your JSON schema as Python dataclasses☆63Updated last year
- An intelligent OCR to detect tables and pure text inside PDFs and obtaing a csv file and a txt from it☆14Updated 6 years ago
- Charts with pure python☆57Updated 11 months ago
- Python library to make storing files simple☆27Updated this week