opensanctions / fingerprints
Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.
☆145Updated this week
Related projects ⓘ
Alternatives and complementary repositories for fingerprints
- Ingestors extract the contents of mixed unstructured documents into structured (followthemoney) data.☆59Updated this week
- Framework and command-line tools for integrating FollowTheMoney data streams from multiple sources☆200Updated this week
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆144Updated 10 months ago
- A helper library full of URL-related heuristics.☆64Updated last month
- Data model and processing tools for investigative entity data☆219Updated this week
- API client for Aleph, supports bulk entity and document upload.☆28Updated last month
- Trying to generate name synonyms from wikidata☆33Updated 4 years ago
- Extract networks of entities from journalistic reporting☆47Updated last year
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆42Updated 5 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated last year
- An alpha project combining beneficial ownership and contracting data☆13Updated 3 years ago
- API for OpenSanctions with support for entity search and bulk matching of data collections. Supports Reconciliation API spec.☆71Updated this week
- A browser user interface for manual labeling of record pairs.☆41Updated last year
- Parser and standardizer for politician, individual and organization names.☆128Updated 7 years ago
- Dedupe/batch geocode addresses and venues around the world with libpostal☆82Updated 2 years ago
- Execute OpenRefine JSON scripts without OpenRefine (or Java)☆29Updated last year
- Utility library to turn country names into ISO two-letter codes☆66Updated this week
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆56Updated 9 months ago
- An index data structure for approximate string search.☆23Updated 5 years ago
- A maximum-strength name parser for record linkage.☆34Updated 3 months ago
- Social Feed Manager user interface application.☆153Updated 4 months ago
- CoCrawler is a versatile web crawler built using modern tools and concurrency.☆187Updated 2 years ago
- ⚡️ Enriches data, adding columns based on lookups to online services☆23Updated 3 weeks ago
- Tag news stories based on models trained on the NYT corpus.☆40Updated last year
- Lightweight web scraping toolkit for documents and structured data.☆310Updated 10 months ago
- Parse Popolo JSON data and navigate it with Python☆15Updated 4 years ago
- Resources for tackling record linkage / deduplication / data matching problems☆112Updated 9 months ago
- Abydos NLP/IR library for Python☆183Updated 2 years ago
- Scalable String Similarity Joins in Python☆39Updated 4 months ago
- A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.☆13Updated last year