Lightweight web scraping toolkit for documents and structured data.
☆315Jan 10, 2024Updated 2 years ago
Alternatives and similar repositories for memorious
Users that are interested in memorious are comparing it to the libraries listed below
Sorting:
- Search and browse documents and data; find the people and companies you look for.☆2,323Feb 20, 2026Updated last week
- API client for Aleph, supports bulk entity and document upload.☆29Feb 18, 2026Updated last week
- An open database of international sanctions data, persons of interest and politically exposed persons☆684Updated this week
- Ingestors extract the contents of mixed unstructured documents into structured (followthemoney) data.☆65Dec 19, 2025Updated 2 months ago
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆25Jul 15, 2025Updated 7 months ago
- Now included in rigour☆151Nov 24, 2025Updated 3 months ago
- Machine assisted dossiers☆19Oct 12, 2017Updated 8 years ago
- How can we improve name matching in screening tools?☆15Aug 13, 2025Updated 6 months ago
- Official repo documenting the closure of Sunlight Labs☆11Sep 28, 2016Updated 9 years ago
- Trying to generate name synonyms from wikidata☆35Jun 28, 2020Updated 5 years ago
- Provide partial dates and retain the date precision through processing☆14Aug 4, 2025Updated 6 months ago
- Binary Python bindings for poppler utils for content extraction☆42May 12, 2021Updated 4 years ago
- Extract networks of entities from journalistic reporting☆49Jul 17, 2023Updated 2 years ago
- An alpha project combining beneficial ownership and contracting data☆13Jun 9, 2021Updated 4 years ago
- Simple way to build a new dict based on fields declaration☆15May 7, 2019Updated 6 years ago
- Utility library to turn country names into ISO two-letter codes☆71Aug 4, 2025Updated 6 months ago
- Next-gen web application for public finance data warehouses, formerly OpenSpending☆57Jul 6, 2022Updated 3 years ago
- International Address formatter which considers the standard formatting rules of the country☆13Nov 21, 2024Updated last year
- A re-useable, stand-alone version of LittleSis network storytelling tool☆12Jan 30, 2016Updated 10 years ago
- Loading OpenSanctions into Neo4J and Linkurious☆31Dec 17, 2024Updated last year
- Platform for journalists to search, analyse, categorise and share unstructured data☆58Updated this week
- Mixins for Django Rest Framework Serializer☆19Feb 18, 2019Updated 7 years ago
- Transistor, a Python web scraping framework for intelligent use cases.☆213Jan 29, 2026Updated last month
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆157Sep 12, 2025Updated 5 months ago
- Jiraya - Simple Jira CLI☆17Dec 13, 2019Updated 6 years ago
- Easily crowdsource the analysis of your documents☆102Nov 7, 2017Updated 8 years ago
- Extendable CMS for small news organizations following decoupled CMS design paradigm. Built on Django Rest Framework. Dynamic image resizi…☆17Mar 7, 2017Updated 8 years ago
- Data cleaning and validation functions for names, languages, identifiers, etc.☆56Feb 10, 2026Updated 2 weeks ago
- A cross-platform command line tool for parallelised content extraction and analysis.☆254Jan 21, 2026Updated last month
- A self‑hosted search engine for documents☆710Updated this week
- Versammlungen in Berlin: Konservieren historischer Daten.☆16Updated this week
- The data journalism platform with built in training☆311Dec 3, 2024Updated last year
- Provides rapidjson support with parser and renderer☆19Oct 7, 2023Updated 2 years ago
- ⛏ a library for scraping unreliable pages☆212Feb 20, 2026Updated last week
- Python library for MIME type parsing, normalisation and grouping.☆13Nov 13, 2024Updated last year
- Web scraping Page Objects core library☆104Jan 27, 2026Updated last month
- A simple app to add OAuth-based authentication in front of an S3 bucket-based static website.☆11Dec 8, 2022Updated 3 years ago
- No hassle, just sending emails☆11Jan 29, 2025Updated last year
- A Python helper library to convert between ISO 639 two- and three-letter codes.☆11Nov 13, 2024Updated last year