dataresearchcenter / datasetsLinks
A collaborative collection of structured datasets and document collections that are common to use within "Follow the Money" investigations.
☆13Updated this week
Alternatives and similar repositories for datasets
Users that are interested in datasets are comparing it to the libraries listed below
Sorting:
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆23Updated last year
- OCCRP and media partners collected data on COVID-19 related spending from across Europe from February to October 2020☆13Updated 4 years ago
- transform a datapoint from a website into a CSV time-series dataset using the wayback machine☆12Updated 2 years ago
- 🔎 Finds fuzzy matches between datasets☆13Updated this week
- Extract networks of entities from journalistic reporting☆48Updated last year
- A Python library for defining rule-based overrides on messy data☆14Updated last month
- A tool for telling stories with maps.☆27Updated 8 months ago
- 🗞 Monitors data sources, alerts you when they change☆12Updated 3 years ago
- Easily download U.S. census maps☆33Updated 2 years ago
- Frontend interface for Datashare, a self-hosted search engine for documents.☆35Updated this week
- ⚡️ Enriches data, adding columns based on lookups to online services☆22Updated last week
- A step-by-step guide to publishing a standalone story from a dataset.☆30Updated 2 months ago
- GIS data for the U.S.-Mexico border fence (perhaps a wall in the future)☆28Updated 7 years ago
- Scripts to download the U.S. Department of Justice's National Caseload Data and load it into Amazon Athena for querying☆13Updated 2 years ago
- How Quartz used AI to help reporters search the Mauritius Leaks☆47Updated 5 years ago
- semantic search for your spreadsheets☆29Updated this week
- Workbook to teach the concept of risk ratios for data journalism applications☆33Updated 3 years ago
- Inspect Element is a practitioner's guide to auditing algorithms and data-driven investigations☆35Updated last month
- Python parser for the Archie Markup Language (ArchieML)☆12Updated 3 years ago
- Docker Container for a Make-based, PDF extraction using OCR☆12Updated 10 months ago
- Machine assisted dossiers☆19Updated 7 years ago
- POLITICO's system for managing civic data☆20Updated 2 years ago
- Collaborative data collection tool developed by the Associated Press☆109Updated 2 years ago
- Data and scripts relating to the publishing of the House expenditure reports, and hopefully the Senate's in future.☆24Updated 4 years ago
- A demonstration of how to build and publish pages with the baker build tool☆20Updated 9 months ago
- ☆23Updated 9 years ago
- DocumentCloud's back end source code - Please report bugs, issues and feature requests to info@documentcloud.org☆39Updated 2 weeks ago
- Teaching guide for a one-hour hands-on session at an IRE/NICAR conference on using pandas to analyze data.☆20Updated 3 months ago
- Platform for journalists to search, analyse, categorise and share unstructured data☆55Updated last month
- Materials to reproduce findings in our stories, "Swinging the Vote?", and "To Gmail, Most Black Lives Matter Emails Are 'Promotions'"☆38Updated 11 months ago