opensanctions / rigour
Data cleaning and validation functions for names, languages, identifiers, etc.
☆17Updated last month
Alternatives and similar repositories for rigour:
Users that are interested in rigour are comparing it to the libraries listed below
- Commons of stupid, simple Python micro functions. Pull requests very welcome.☆19Updated 2 years ago
- Ingestors extract the contents of mixed unstructured documents into structured (followthemoney) data.☆59Updated last week
- A Python library for defining rule-based overrides on messy data☆13Updated 3 months ago
- An alpha project combining beneficial ownership and contracting data☆13Updated 3 years ago
- API client for Aleph, supports bulk entity and document upload.☆28Updated 4 months ago
- https://mimesniff.spec.whatwg.org/ implementation for Python☆14Updated last year
- Provide partial dates and retain the date precision through processing☆13Updated 2 years ago
- Loading OpenSanctions into Neo4J and Linkurious☆28Updated 2 months ago
- How can we improve name matching in screening tools?☆12Updated last month
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆15Updated last year
- (Archived) A Python library for record linkage and deduplication.☆19Updated 11 months ago
- API for OpenSanctions with support for entity search and bulk matching of data collections. Supports Reconciliation API spec.☆79Updated this week
- Utility library to turn country names into ISO two-letter codes☆66Updated 2 weeks ago
- A helper library full of URL-related heuristics.☆66Updated 4 months ago
- CSV on the web☆38Updated last week
- International Address formatter which considers the standard formatting rules of the country☆26Updated 3 years ago
- Sort-friendly URI Reordering Transform (SURT) python module☆41Updated 7 months ago
- A markdown wiki and dashboarding system for Datasette☆21Updated 3 years ago
- Simple tool to pull posts and users from Gab☆16Updated 2 weeks ago
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆15Updated this week
- Trough: Big data, small databases.☆40Updated 7 months ago
- Trying to generate name synonyms from wikidata☆32Updated 4 years ago
- Build requirements files from setup.py.☆27Updated 2 years ago
- jq module to process Wikidata JSON format☆11Updated 5 years ago
- Python based Wikidata framework for easy dataframe extraction☆42Updated last year
- Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.☆148Updated last month
- This collaborative resource aims at empowering all actors countering information manipulation to grow and improve.☆15Updated last year
- Backports for ckan.plugins.toolkit to ease CKAN extension compatibility☆17Updated 2 years ago
- Framework and command-line tools for integrating FollowTheMoney data streams from multiple sources☆204Updated last week
- Datasette plugin for authenticating access using API tokens☆11Updated 6 months ago