opensanctions / rigour
Data cleaning and validation functions for names, languages, identifiers, etc.
☆20Updated this week
Alternatives and similar repositories for rigour
Users that are interested in rigour are comparing it to the libraries listed below
Sorting:
- Commons of stupid, simple Python micro functions. Pull requests very welcome.☆19Updated last month
- Ingestors extract the contents of mixed unstructured documents into structured (followthemoney) data.☆62Updated 2 weeks ago
- A Python library for defining rule-based overrides on messy data☆13Updated last month
- Extract networks of entities from journalistic reporting☆48Updated last year
- Write Datasette canned queries as plain SQL files☆13Updated 2 years ago
- Provide partial dates and retain the date precision through processing☆13Updated 2 years ago
- API client for Aleph, supports bulk entity and document upload.☆28Updated 7 months ago
- How can we improve name matching in screening tools?☆12Updated 3 months ago
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆16Updated this week
- https://mimesniff.spec.whatwg.org/ implementation for Python☆13Updated last year
- An alpha project combining beneficial ownership and contracting data☆13Updated 3 years ago
- Ask questions about government data.☆37Updated 6 years ago
- Save FEC campaign finance data to a SQLite database☆11Updated last year
- Datasette plugin for searching all searchable tables at once☆24Updated 8 months ago
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆23Updated last year
- Utility library to turn country names into ISO two-letter codes☆66Updated 3 months ago
- Datasette plugin providing instructions for exporting data to Jupyter or Observable☆12Updated last year
- Now included in rigour☆151Updated last week
- Datasette plugin adding SQL functions for fuzzy text matching powered by Jellyfish☆12Updated last year
- A markdown wiki and dashboarding system for Datasette☆21Updated 3 years ago
- A helper library full of URL-related heuristics.☆69Updated last month
- 🗞 Monitors data sources, alerts you when they change☆12Updated 3 years ago
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15Updated 10 years ago
- A Python library for dewarping/straightening/reformatting document images and PDFs☆16Updated 2 months ago
- Adds a reconciliation API endpoint to Datasette, based on the Reconciliation Service API specification.☆24Updated last year
- Datasette plugin for uploading CSV files and converting them to database tables☆26Updated last year
- Datasette plugin for modifying table schemas☆18Updated 8 months ago
- Tools for running enrichments against data stored in Datasette☆23Updated last week
- OpenSSF Scorecard for top Python packages☆16Updated last week
- Searchable transcripts of the Post Office Horizon IT Inquiry hearings☆11Updated 2 weeks ago