Binary Python bindings for poppler utils for content extraction
☆42May 12, 2021Updated 4 years ago
Alternatives and similar repositories for pdflib
Users that are interested in pdflib are comparing it to the libraries listed below
Sorting:
- dataeaze Enterprise Gen AI and Foundation Models product suite☆32Feb 11, 2026Updated 2 weeks ago
- Extract structured data from free text using large language models☆17Feb 13, 2026Updated 2 weeks ago
- React UI component library for aleph/followthemoney☆12Nov 22, 2022Updated 3 years ago
- A Python library for defining rule-based overrides on messy data☆18Nov 24, 2025Updated 3 months ago
- An alpha project combining beneficial ownership and contracting data☆13Jun 9, 2021Updated 4 years ago
- Django REST API extension enables chained relations, filters, field selectors, limit, offset, etc., via a single view.☆10Mar 31, 2021Updated 4 years ago
- A Python helper library to convert between ISO 639 two- and three-letter codes.☆11Nov 13, 2024Updated last year
- Bindings for Angular applications to use Cubes Slicer API☆12Nov 3, 2022Updated 3 years ago
- ☆57Oct 10, 2012Updated 13 years ago
- Provide partial dates and retain the date precision through processing☆14Aug 4, 2025Updated 6 months ago
- A data science enviornment for Ubuntu 14.04 server and desktop☆14Oct 1, 2020Updated 5 years ago
- JSONP support for Django REST Framework☆22Dec 26, 2022Updated 3 years ago
- provides a way to integrate a sphinx based documentation into your app.☆33Aug 5, 2019Updated 6 years ago
- Re-usable wrapper scripts for text document extractors.☆37Jun 18, 2016Updated 9 years ago
- Turn your Django project into RESTFul APIs in a minute.☆17Dec 8, 2015Updated 10 years ago
- Utility library to turn country names into ISO two-letter codes☆71Aug 4, 2025Updated 6 months ago
- Django @context decorator☆23Mar 7, 2024Updated last year
- Lightweight web scraping toolkit for documents and structured data.☆315Jan 10, 2024Updated 2 years ago
- Web app interface for geocoding addresses in CSV files.☆18May 21, 2018Updated 7 years ago
- ☆21Sep 9, 2012Updated 13 years ago
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆25Jul 15, 2025Updated 7 months ago
- Flame package for django suit.☆20Sep 7, 2015Updated 10 years ago
- The smart-match module contains functions for calculating strings/sets similarity.☆14Feb 22, 2024Updated 2 years ago
- Ingestors extract the contents of mixed unstructured documents into structured (followthemoney) data.☆65Dec 19, 2025Updated 2 months ago
- Platform for journalists to search, analyse, categorise and share unstructured data☆58Updated this week
- Validate integrity of Amazon SNS messages☆18Jul 3, 2018Updated 7 years ago
- Transform flat data structures into nested object graphs matching JSON schema definitions.☆28Aug 9, 2016Updated 9 years ago
- Copy the contents of one SQL database to another☆27May 2, 2022Updated 3 years ago
- API client for Aleph, supports bulk entity and document upload.☆29Feb 18, 2026Updated last week
- Data model and processing tools for investigative entity data☆263Feb 20, 2026Updated last week
- Machine Learning Project Development Tool☆31Oct 30, 2025Updated 4 months ago
- Fork of original Django ROA lib. Now ROA works directly with an API like Django Rest Framework☆51Feb 17, 2015Updated 11 years ago
- PyRDM is a Python-based library for research data management (RDM). It facilitates the automated publication of scientific software and a…☆32Oct 30, 2021Updated 4 years ago
- Trying to generate name synonyms from wikidata☆35Jun 28, 2020Updated 5 years ago
- neonion is a user-centered collaborative semantic annotation webapp developed at the Human-Centered Computing group at Freie Universität …☆69Feb 13, 2019Updated 7 years ago
- Implementation of the SAN model for imparting gender privacy to face images☆63Jul 7, 2021Updated 4 years ago
- ☆10Jan 26, 2026Updated last month
- ☆12Sep 23, 2025Updated 5 months ago
- An image analysis tool for measuring microorganism colony growth☆12Dec 12, 2025Updated 2 months ago