guardian / giant
Platform for journalists to search, analyse, categorise and share unstructured data
☆55Updated 2 weeks ago
Alternatives and similar repositories for giant:
Users that are interested in giant are comparing it to the libraries listed below
- A Python library for defining rule-based overrides on messy data☆13Updated 2 weeks ago
- Extract networks of entities from journalistic reporting☆48Updated last year
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆23Updated last year
- Provide partial dates and retain the date precision through processing☆13Updated 2 years ago
- Adds a reconciliation API endpoint to Datasette, based on the Reconciliation Service API specification.☆24Updated last year
- A collaborative collection of datasets that are common to use within "Follow the Money" investigations with european scope☆13Updated 11 months ago
- America's most comprehensive dictionary of campaign finance jargon. A free resource created by and for data journalists.☆17Updated last month
- transform a datapoint from a website into a CSV time-series dataset using the wayback machine☆12Updated last year
- A friendly library for working with PDFs☆14Updated this week
- How Quartz used AI to help reporters search the Mauritius Leaks☆47Updated 5 years ago
- ☆14Updated last year
- A resource for anyone helping journalists and newsrooms step up their security practices.☆39Updated last year
- A general purpose tool for text-based crosswalking☆106Updated last year
- Scripts to download the U.S. Department of Justice's National Caseload Data and load it into Amazon Athena for querying☆13Updated last year
- Materials to reproduce findings in our stories, "Swinging the Vote?", and "To Gmail, Most Black Lives Matter Emails Are 'Promotions'"☆38Updated 10 months ago
- OCCRP and media partners collected data on COVID-19 related spending from across Europe from February to October 2020☆13Updated 4 years ago
- Pull out versions of specific files from a gitscraping repo into individual files.☆15Updated 3 years ago
- a general list of resources and articles for people interested in getting into data journalism☆16Updated 2 years ago
- Collaborative data collection tool developed by the Associated Press☆109Updated 2 years ago
- Ingestors extract the contents of mixed unstructured documents into structured (followthemoney) data.☆62Updated this week
- Machine assisted dossiers☆19Updated 7 years ago
- An alpha project combining beneficial ownership and contracting data☆13Updated 3 years ago
- Materials to reproduce findings in our story, "Google’s Top Search Result? Surprise! It’s Google"☆34Updated 4 years ago
- Twitter, quick. Fetch and store tweets on short notice.☆80Updated 8 years ago
- A step-by-step guide to publishing a standalone story from a dataset.☆30Updated last month
- ☆14Updated last month
- A build tool for data projects.☆49Updated 4 months ago
- Official repo documenting the closure of Sunlight Labs☆11Updated 8 years ago
- Core library for the datakit CLI framework.☆55Updated 2 years ago
- A collection of cheat sheets for remembering common commands and tips for data journalism work.☆37Updated last year