Platform for journalists to search, analyse, categorise and share unstructured data
☆59Apr 27, 2026Updated last week
Alternatives and similar repositories for giant
Users that are interested in giant are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An alpha project combining beneficial ownership and contracting data☆13Jun 9, 2021Updated 4 years ago
- Extract networks of entities from journalistic reporting☆49Jul 17, 2023Updated 2 years ago
- A collaborative collection of structured datasets and document collections that are common to use within "Follow the Money" investigation…☆15Apr 14, 2026Updated 2 weeks ago
- A demonstration transnational register of beneficial ownership data from the UK, Denmark, Slovakia and Armenia☆19Oct 30, 2024Updated last year
- How can we improve name matching in screening tools?☆16Aug 13, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Notes for my talk "Exploring the Radio Spectrum for News"☆13Mar 6, 2020Updated 6 years ago
- Ingestors extract the contents of mixed unstructured documents into structured (followthemoney) data.☆65Dec 19, 2025Updated 4 months ago
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆25Jul 15, 2025Updated 9 months ago
- API for OpenSanctions with support for entity search and bulk matching of data collections. Supports Reconciliation API spec.☆130Updated this week
- International Address formatter which considers the standard formatting rules of the country☆14Nov 21, 2024Updated last year
- The Toolkit API, app, and browser extension. Start preserving now.☆50Apr 6, 2026Updated 3 weeks ago
- Machine assisted dossiers☆19Oct 12, 2017Updated 8 years ago
- A Python library for defining rule-based overrides on messy data☆18Nov 24, 2025Updated 5 months ago
- Trying to generate name synonyms from wikidata☆34Jun 28, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Frontend interface for Datashare, a self-hosted search engine for documents.☆38Updated this week
- Encryption for Journalists - Hacks/Hackers NYC☆40Oct 3, 2013Updated 12 years ago
- Provide partial dates and retain the date precision through processing☆14Aug 4, 2025Updated 9 months ago
- Tables is a simple command-line tool and powerful library for importing data like a CSV or JSON file into relational tables.☆14Mar 23, 2026Updated last month
- A re-useable, stand-alone version of LittleSis network storytelling tool☆12Jan 30, 2016Updated 10 years ago
- A self‑hosted search engine for documents☆730Apr 27, 2026Updated last week
- US election metadata, packaged as python!☆10Mar 16, 2022Updated 4 years ago
- React/Redux Chartwerk editor.☆10Oct 5, 2018Updated 7 years ago
- Data notification service: subscribe to keywords and get notified whenever an open data sources mentions that keyword.☆24Aug 25, 2013Updated 12 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Next-gen web application for public finance data warehouses, formerly OpenSpending☆57Jul 6, 2022Updated 3 years ago
- OCCRP and media partners collected data on COVID-19 related spending from across Europe from February to October 2020☆13Nov 26, 2020Updated 5 years ago
- ☆14Sep 11, 2019Updated 6 years ago
- An international meta organization to foster news nerd collaboration and knowledge sharing☆112May 17, 2019Updated 6 years ago
- Ask questions about government data.☆38Jan 17, 2019Updated 7 years ago
- A collection of lists of forms maintained by local, state and federal policing organizations. If you have a form name, you have a FOIA re…☆18Updated this week
- Course materials for SMPA3193, Building Systems for Reporting☆29Apr 25, 2017Updated 9 years ago
- transform a datapoint from a website into a CSV time-series dataset using the wayback machine☆12May 24, 2023Updated 2 years ago
- A Go port of FollowTheMoney (FtM) — a pragmatic data model for people, companies, assets, relationships and documents used in investigati…☆21Sep 8, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Python helper library to convert between ISO 639 two- and three-letter codes.☆11Nov 13, 2024Updated last year
- A simple app to add OAuth-based authentication in front of an S3 bucket-based static website.☆11Dec 8, 2022Updated 3 years ago
- A library for working with the OCOD dataset for analysis of property in England and Wales owned by offshore companies☆13Apr 14, 2026Updated 2 weeks ago
- A work-in-progress guide showing how and why you should learn command-line tools (xsv, csvkit) to work with data☆19Mar 16, 2019Updated 7 years ago
- Code for extracting data from a large number of PDFs, particularly FCC political ad documents☆15Oct 26, 2017Updated 8 years ago
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15May 2, 2015Updated 11 years ago
- QubesOS dom0 automation in Python☆13Aug 3, 2017Updated 8 years ago