Lightweight web scraping toolkit for documents and structured data.
☆315May 20, 2026Updated last month
Alternatives and similar repositories for memorious
Users that are interested in memorious are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data model and processing tools for investigative entity data☆275Feb 28, 2026Updated 4 months ago
- Search and browse documents and data; find the people and companies you look for.☆2,386Feb 20, 2026Updated 4 months ago
- DEPRECATED. Desktop graph visualization application☆51Sep 30, 2022Updated 3 years ago
- API client for Aleph, supports bulk entity and document upload.☆30Mar 5, 2026Updated 3 months ago
- An open database of international sanctions data, persons of interest and politically exposed persons☆755Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Ingestors extract the contents of mixed unstructured documents into structured (followthemoney) data.☆66Dec 19, 2025Updated 6 months ago
- Machine assisted dossiers☆19Oct 12, 2017Updated 8 years ago
- A re-useable, stand-alone version of LittleSis network storytelling tool☆12Jan 30, 2016Updated 10 years ago
- An alpha project combining beneficial ownership and contracting data☆13Jun 9, 2021Updated 5 years ago
- Now included in rigour☆150Nov 24, 2025Updated 7 months ago
- How can we improve name matching in screening tools?☆16Aug 13, 2025Updated 10 months ago
- Loading OpenSanctions into Neo4J and Linkurious☆31Dec 17, 2024Updated last year
- A visualisation library for beneficial ownership structures☆28Mar 29, 2026Updated 3 months ago
- Trying to generate name synonyms from wikidata☆35Jun 28, 2020Updated 6 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Framework and command-line tools for integrating FollowTheMoney data streams from multiple sources☆249Updated this week
- Extract networks of entities from journalistic reporting☆49Jul 17, 2023Updated 2 years ago
- Official repo documenting the closure of Sunlight Labs☆11Sep 28, 2016Updated 9 years ago
- Provide partial dates and retain the date precision through processing☆14Aug 4, 2025Updated 10 months ago
- Data cleaning and validation functions for names, languages, identifiers, etc.☆63Updated this week
- A Python library for defining rule-based overrides on messy data☆18Nov 24, 2025Updated 7 months ago
- Binary Python bindings for poppler utils for content extraction☆42May 12, 2021Updated 5 years ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆157Mar 8, 2026Updated 3 months ago
- Next-gen web application for public finance data warehouses, formerly OpenSpending☆57Jul 6, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Utility library to turn country names into ISO two-letter codes☆71May 29, 2026Updated last month
- Versammlungen in Berlin: Konservieren historischer Daten.☆17Updated this week
- Platform for journalists to search, analyse, categorise and share unstructured data☆59Updated this week
- Commons of stupid, simple Python micro functions. Pull requests very welcome.☆21Jun 20, 2026Updated last week
- Transform flat data structures into nested object graphs matching JSON schema definitions.☆28Aug 9, 2016Updated 9 years ago
- Transistor, a Python web scraping framework for intelligent use cases.☆211Jan 29, 2026Updated 5 months ago
- A self‑hosted search engine for documents☆742Updated this week
- API for OpenSanctions with support for entity search and bulk matching of data collections. Supports Reconciliation API spec.☆151Updated this week
- A command line utility for listing and searching snapshots in web archives☆18Jun 4, 2026Updated 3 weeks ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A cross-platform command line tool for parallelised content extraction and analysis.☆256Updated this week
- Simple way to build a new dict based on fields declaration☆15May 7, 2019Updated 7 years ago
- The data journalism platform with built in training☆312Dec 3, 2024Updated last year
- The OpenTrials API service + database schema definition.☆12Nov 18, 2018Updated 7 years ago
- Crawler that collects and extracts content of daily published news articles☆14Feb 18, 2023Updated 3 years ago
- Data about every national legislature in the world, freely available for you to use☆21Sep 18, 2017Updated 8 years ago
- Easily crowdsource the analysis of your documents☆102Nov 7, 2017Updated 8 years ago