guardian / giantLinks
Platform for journalists to search, analyse, categorise and share unstructured data
☆55Updated last month
Alternatives and similar repositories for giant
Users that are interested in giant are comparing it to the libraries listed below
Sorting:
- A Python library for defining rule-based overrides on messy data☆14Updated last month
- Provide partial dates and retain the date precision through processing☆13Updated 2 years ago
- Extract networks of entities from journalistic reporting☆48Updated last year
- A general purpose tool for text-based crosswalking☆107Updated last year
- Adds a reconciliation API endpoint to Datasette, based on the Reconciliation Service API specification.☆24Updated last year
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆23Updated last year
- America's most comprehensive dictionary of campaign finance jargon. A free resource created by and for data journalists.☆17Updated this week
- Investigative tool for extracting relevant areas from many documents☆14Updated 9 years ago
- Scripts to download the U.S. Department of Justice's National Caseload Data and load it into Amazon Athena for querying☆13Updated 2 years ago
- A resource for anyone helping journalists and newsrooms step up their security practices.☆39Updated last year
- How Quartz used AI to help reporters search the Mauritius Leaks☆47Updated 5 years ago
- Ingestors extract the contents of mixed unstructured documents into structured (followthemoney) data.☆63Updated last month
- A build tool for data projects.☆49Updated 5 months ago
- Materials to reproduce findings in our stories, "Swinging the Vote?", and "To Gmail, Most Black Lives Matter Emails Are 'Promotions'"☆38Updated 11 months ago
- An alpha project combining beneficial ownership and contracting data☆13Updated 3 years ago
- OCCRP and media partners collected data on COVID-19 related spending from across Europe from February to October 2020☆13Updated 4 years ago
- Machine assisted dossiers☆19Updated 7 years ago
- Docker Container for a Make-based, PDF extraction using OCR☆12Updated 10 months ago
- ☆15Updated 2 weeks ago
- A friendly library for working with PDFs☆27Updated this week
- Official repo documenting the closure of Sunlight Labs☆11Updated 8 years ago
- Command-line tool for exploring the PAC donor-recipient relationship☆55Updated 10 years ago
- Data and analysis supporting several passages in the BuzzFeed News article, "The New American Slavery: Invited To The U.S., Foreign Worke…☆28Updated 8 years ago
- semantic search for your spreadsheets☆29Updated this week
- A collection of cheat sheets for remembering common commands and tips for data journalism work.☆37Updated last year
- Inspect Element is a practitioner's guide to auditing algorithms and data-driven investigations☆35Updated last month
- ☆36Updated 2 years ago
- ☆14Updated 9 years ago
- transform a datapoint from a website into a CSV time-series dataset using the wayback machine☆12Updated 2 years ago
- Scrapers for U.S. county court sites.☆67Updated 2 years ago