guardian / giant
Platform for journalists to search, analyse, categorise and share unstructured data
☆54Updated this week
Alternatives and similar repositories for giant:
Users that are interested in giant are comparing it to the libraries listed below
- Extract networks of entities from journalistic reporting☆48Updated last year
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆19Updated last year
- A Python library for defining rule-based overrides on messy data☆13Updated 4 months ago
- Provide partial dates and retain the date precision through processing☆13Updated 2 years ago
- Adds a reconciliation API endpoint to Datasette, based on the Reconciliation Service API specification.☆24Updated last year
- A general purpose tool for text-based crosswalking☆105Updated last year
- America's most comprehensive dictionary of campaign finance jargon. A free resource created by and for data journalists.☆17Updated last week
- Inspect Element is a practitioner's guide to auditing algorithms and data-driven investigations☆34Updated 3 months ago
- An alpha project combining beneficial ownership and contracting data☆13Updated 3 years ago
- OCCRP and media partners collected data on COVID-19 related spending from across Europe from February to October 2020☆13Updated 4 years ago
- Scripts to download the U.S. Department of Justice's National Caseload Data and load it into Amazon Athena for querying☆13Updated last year
- A collection of cheat sheets for remembering common commands and tips for data journalism work.☆37Updated last year
- A friendly library for working with PDFs☆11Updated last week
- ☆14Updated last year
- Fraud detection related data and scripts to share with partners.☆23Updated 2 years ago
- A LevelDB backed URL unshortening microservice written in JavaScript☆31Updated 2 years ago
- transform a datapoint from a website into a CSV time-series dataset using the wayback machine☆12Updated last year
- Collaborative data collection tool developed by the Associated Press☆109Updated 2 years ago
- Using Fly.io to generate map tiles☆19Updated last year
- semantic search for your spreadsheets☆25Updated this week
- A build tool for data projects.☆49Updated 3 months ago
- ☆14Updated last month
- 🔎 Finds fuzzy matches between datasets☆12Updated 2 months ago
- Service for creating Twitter datasets for research and archiving.☆26Updated 2 years ago
- Materials to reproduce findings in our stories, "Swinging the Vote?", and "To Gmail, Most Black Lives Matter Emails Are 'Promotions'"☆38Updated 9 months ago
- ReproZip for the Preservation of Web Applications☆17Updated 11 months ago
- Demonstration project for building out a data news rig.☆10Updated 3 years ago
- Basic cookiecutter template for Python projects☆20Updated 6 months ago
- A minimal Akoma Ntoso -based legal informatics toolchain☆14Updated last year
- Data conversions and examples for generating reports from twarc collections using tools such as D3.js☆55Updated 5 years ago