maxharlow / textmatchLinks
π Finds fuzzy matches between datasets
β13Updated last month
Alternatives and similar repositories for textmatch
Users that are interested in textmatch are comparing it to the libraries listed below
Sorting:
- A general purpose tool for text-based crosswalkingβ107Updated last year
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money dataβ23Updated last year
- Scrapers for U.S. county court sites.β70Updated 2 years ago
- America's most comprehensive dictionary of campaign finance jargon. A free resource created by and for data journalists.β17Updated last week
- Easily download U.S. census mapsβ33Updated 2 years ago
- Adds a reconciliation API endpoint to Datasette, based on the Reconciliation Service API specification.β24Updated last year
- Inspect Element is a practitioner's guide to auditing algorithms and data-driven investigationsβ36Updated this week
- List of publicly available, free/open source and open access resources for learning and doing data journalism.β47Updated last year
- A friendly library for working with PDFsβ31Updated this week
- Extract networks of entities from journalistic reportingβ48Updated last year
- β14Updated last year
- Platform for journalists to search, analyse, categorise and share unstructured dataβ55Updated last week
- A tool for telling stories with maps.β27Updated 9 months ago
- A collaborative collection of structured datasets and document collections that are common to use within "Follow the Money" investigationβ¦β13Updated last week
- A light-weight wrapper for the Datawrapper API.β63Updated 11 months ago
- π Practical beginner-level introductions to using different tools and technologies, with a focus on their application in the newsroomβ82Updated 2 years ago
- π Doing all sorts of things, the DataMade wayβ98Updated 4 months ago
- β15Updated last month
- Svelte components implementing pandoc's JSON AST for describing documents.β20Updated 6 months ago
- Docker Container for a Make-based, PDF extraction using OCRβ12Updated 11 months ago
- β‘οΈ Enriches data, adding columns based on lookups to online servicesβ22Updated last week
- A tutorial on optical character recognition using tesseract, ImageMagick and other open source toolsβ69Updated 5 months ago
- A reconciliation service for OpenRefine serving data from a given CSV file.β79Updated 5 months ago
- How can we improve name matching in screening tools?β12Updated 5 months ago
- An extremely fast FEC filing parser written in Cβ76Updated 2 months ago
- Collaborative data collection tool developed by the Associated Pressβ109Updated 2 years ago
- Helper functions for journalism projects.β24Updated this week
- ReproZip for the Preservation of Web Applicationsβ17Updated last year
- A collection of cheat sheets for remembering common commands and tips for data journalism work.β38Updated last year
- An open-source archive that gathers, saves, shares and analyzes news homepagesβ139Updated last week