Binary Python bindings for poppler utils for content extraction
☆42May 12, 2021Updated 5 years ago
Alternatives and similar repositories for pdflib
Users that are interested in pdflib are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Extract structured data from free text using large language models☆24Jun 22, 2026Updated last week
- A Python library for defining rule-based overrides on messy data☆18Nov 24, 2025Updated 7 months ago
- A Python helper library to convert between ISO 639 two- and three-letter codes.☆11Nov 13, 2024Updated last year
- An alpha project combining beneficial ownership and contracting data☆13Jun 9, 2021Updated 5 years ago
- Transform flat data structures into nested object graphs matching JSON schema definitions.☆28Aug 9, 2016Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- React UI component library for aleph/followthemoney☆12Nov 22, 2022Updated 3 years ago
- Re-usable wrapper scripts for text document extractors.☆37Jun 18, 2016Updated 10 years ago
- Bindings for Angular applications to use Cubes Slicer API☆12Nov 3, 2022Updated 3 years ago
- Provide partial dates and retain the date precision through processing☆14Aug 4, 2025Updated 10 months ago
- Next-gen web application for public finance data warehouses, formerly OpenSpending☆57Jul 6, 2022Updated 3 years ago
- Demo Django app for IoT☆14Mar 30, 2016Updated 10 years ago
- Platform for journalists to search, analyse, categorise and share unstructured data☆59Jun 23, 2026Updated last week
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆27Jul 15, 2025Updated 11 months ago
- Ingestors extract the contents of mixed unstructured documents into structured (followthemoney) data.☆66Dec 19, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Command-line tool for exploring the PAC donor-recipient relationship☆55Dec 18, 2014Updated 11 years ago
- Lightweight web scraping toolkit for documents and structured data.☆315May 20, 2026Updated last month
- Copy the contents of one SQL database to another☆27May 2, 2022Updated 4 years ago
- Learning String Alignments for Entity Aliases