maxharlow / textmatch
🔎 Finds fuzzy matches between datasets
☆12Updated last month
Alternatives and similar repositories for textmatch:
Users that are interested in textmatch are comparing it to the libraries listed below
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆15Updated last year
- ☆14Updated 3 weeks ago
- A collaborative collection of datasets that are common to use within "Follow the Money" investigations with european scope☆13Updated 9 months ago
- A step-by-step guide to publishing a standalone story from a dataset.☆30Updated 2 weeks ago
- America's most comprehensive dictionary of campaign finance jargon. A free resource created by and for data journalists.☆17Updated 3 weeks ago
- Teaching guide for a one-hour hands-on session at an IRE/NICAR conference on using pandas to analyze data.☆20Updated 3 weeks ago
- Docker Container for a Make-based, PDF extraction using OCR☆12Updated 7 months ago
- ☆14Updated last year
- Extract networks of entities from journalistic reporting☆48Updated last year
- Workbook to teach the concept of risk ratios for data journalism applications☆31Updated 2 years ago
- A general purpose tool for text-based crosswalking☆105Updated last year
- Easily download U.S. census maps☆33Updated 2 years ago
- ☆11Updated 2 weeks ago
- Fraud detection related data and scripts to share with partners.☆23Updated 2 years ago
- A tool for telling stories with maps.☆27Updated 6 months ago
- semantic search for your spreadsheets☆24Updated this week
- Teaching guide for a one-hour hands-on session at an IRE/NICAR conference on scraping web data using Python.☆22Updated last year
- Module on both the MA Data Journalism and MA Multiplatform and Mobile Journalism at Birmingham City University☆28Updated last month
- Adds a reconciliation API endpoint to Datasette, based on the Reconciliation Service API specification.☆24Updated last year
- How can we improve name matching in screening tools?☆12Updated 2 months ago
- ☆12Updated 2 years ago
- Project generator for use with the datakit framework.☆28Updated last year
- A light-weight wrapper for the Datawrapper API.☆63Updated 8 months ago
- Scripts to download the U.S. Department of Justice's National Caseload Data and load it into Amazon Athena for querying☆13Updated last year
- yet another foia automation service☆43Updated 2 years ago
- A collection of cheat sheets for remembering common commands and tips for data journalism work.☆37Updated last year
- A tutorial on optical character recognition using tesseract, ImageMagick and other open source tools☆69Updated last month
- Data Journalism training materials☆17Updated 3 years ago
- OCCRP and media partners collected data on COVID-19 related spending from across Europe from February to October 2020☆13Updated 4 years ago
- Analysis and code to accompany The Companies We Keep briefing on the state of the UK's register of Persons of Significant Control.☆19Updated 6 years ago