Platform for journalists to search, analyse, categorise and share unstructured data
☆58Feb 26, 2026Updated this week
Alternatives and similar repositories for giant
Users that are interested in giant are comparing it to the libraries listed below
Sorting:
- Extract networks of entities from journalistic reporting☆49Jul 17, 2023Updated 2 years ago
- An alpha project combining beneficial ownership and contracting data☆13Jun 9, 2021Updated 4 years ago
- A collaborative collection of structured datasets and document collections that are common to use within "Follow the Money" investigation…☆15Updated this week
- Ingestors extract the contents of mixed unstructured documents into structured (followthemoney) data.☆65Dec 19, 2025Updated 2 months ago
- A demonstration transnational register of beneficial ownership data from the UK, Denmark, Slovakia and Armenia☆19Oct 30, 2024Updated last year
- Notes for my talk "Exploring the Radio Spectrum for News"☆13Mar 6, 2020Updated 5 years ago
- How can we improve name matching in screening tools?☆15Aug 13, 2025Updated 6 months ago
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆25Jul 15, 2025Updated 7 months ago
- Tables is a simple command-line tool and powerful library for importing data like a CSV or JSON file into relational tables.☆14Dec 8, 2022Updated 3 years ago
- Machine assisted dossiers☆19Oct 12, 2017Updated 8 years ago
- React/Redux Chartwerk editor.☆10Oct 5, 2018Updated 7 years ago
- US election metadata, packaged as python!☆10Mar 16, 2022Updated 3 years ago
- A Python library for defining rule-based overrides on messy data☆18Nov 24, 2025Updated 3 months ago
- A simple app to add OAuth-based authentication in front of an S3 bucket-based static website.☆11Dec 8, 2022Updated 3 years ago
- API for OpenSanctions with support for entity search and bulk matching of data collections. Supports Reconciliation API spec.☆123Updated this week
- transform a datapoint from a website into a CSV time-series dataset using the wayback machine☆12May 24, 2023Updated 2 years ago
- A self‑hosted search engine for documents☆710Feb 25, 2026Updated last week
- A collection of lists of forms maintained by local, state and federal policing organizations. If you have a form name, you have a FOIA re…☆18Feb 17, 2026Updated 2 weeks ago
- OCCRP and media partners collected data on COVID-19 related spending from across Europe from February to October 2020☆13Nov 26, 2020Updated 5 years ago
- Encryption for Journalists - Hacks/Hackers NYC☆40Oct 3, 2013Updated 12 years ago
- Frontend interface for Datashare, a self-hosted search engine for documents.☆38Feb 25, 2026Updated last week
- Course materials for SMPA3193, Building Systems for Reporting☆29Apr 25, 2017Updated 8 years ago
- Provide partial dates and retain the date precision through processing☆14Aug 4, 2025Updated 7 months ago
- The Toolkit API, app, and browser extension. Start preserving now.☆48Updated this week
- semantic search for text in your spreadsheets☆56Feb 25, 2026Updated last week
- Code for extracting data from a large number of PDFs, particularly FCC political ad documents☆15Oct 26, 2017Updated 8 years ago
- Trying to generate name synonyms from wikidata☆35Jun 28, 2020Updated 5 years ago
- An international meta organization to foster news nerd collaboration and knowledge sharing☆112May 17, 2019Updated 6 years ago
- A work-in-progress guide showing how and why you should learn command-line tools (xsv, csvkit) to work with data☆19Mar 16, 2019Updated 6 years ago
- Map locator image generator☆22Sep 30, 2016Updated 9 years ago
- Next-gen web application for public finance data warehouses, formerly OpenSpending☆57Jul 6, 2022Updated 3 years ago
- An unambiguous dialect of ArchieML☆23Oct 27, 2023Updated 2 years ago
- Data notification service: subscribe to keywords and get notified whenever an open data sources mentions that keyword.☆24Aug 25, 2013Updated 12 years ago
- Methodology behind story on how poor vaccine coverage in the US greatly increased its exposure to Covid hospitalisations relative to peer…☆26Apr 20, 2022Updated 3 years ago
- Code for Newslynx App☆22Oct 19, 2015Updated 10 years ago
- Automated downloads of geographic information system data posted by the National Oceanic and Atmospheric Administration's National Hurric…☆14Updated this week
- A library for working with the OCOD dataset for analysis of property in England and Wales owned by offshore companies☆13Jan 8, 2026Updated last month
- QubesOS dom0 automation in Python☆12Aug 3, 2017Updated 8 years ago
- A Python helper library to convert between ISO 639 two- and three-letter codes.☆11Nov 13, 2024Updated last year