NickCrews / mismoLinks
The SQL/Ibis powered sklearn of record linkage
☆21Updated 3 weeks ago
Alternatives and similar repositories for mismo
Users that are interested in mismo are comparing it to the libraries listed below
Sorting:
- Jupyter Cell / Line Magics for DuckDB☆55Updated 2 months ago
- Fast, accurate, open-source geocoding in Python☆66Updated last week
- A maximum-strength name parser for record linkage.☆39Updated 4 months ago
- A serverless duckDB deployment at GCP☆41Updated 3 years ago
- Linear regression in SQL using dbt☆75Updated last week
- ☆116Updated 2 years ago
- Cross-filter millions (or even billions) of data entries with no interaction delay☆105Updated 2 years ago
- Opinionated JSON to CSV/XLSX/SQLITE/PARQUET converter. Flattens JSON fast.☆204Updated 6 months ago
- The Modern Data Stack in a (Smaller) Box☆12Updated 2 years ago
- 📦 Serverless and local-first Open Data Platform☆304Updated 2 weeks ago
- List of entity resolution software and resources.☆103Updated 10 months ago
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆107Updated this week
- A playground for running duckdb as a stateless query engine over a data lake☆217Updated last year
- ☆279Updated this week
- Add DuckDB, Parquet, CSV and JSON lines support to Datasette☆56Updated last year
- An experimental Athena extension for DuckDB 🐤☆57Updated last year
- ☆92Updated last year
- CLI to create an ER Diagram from DuckDB database files☆144Updated 9 months ago
- This repository contains CROW, the Clerical Resolution Online Widget, an open-source project designed to help data linkers with their cle…☆10Updated last month
- Python+VueJS application to load, explore, combine,transform and deliver data☆102Updated 10 months ago
- Examples for the MotherDuck WASM Client library, enabling MotherDuck integration for WebAssembly-powered DuckDB☆61Updated last month
- An End-to-End Evaluation Framework for Entity Resolution Systems☆36Updated 2 years ago
- ☆19Updated 2 years ago
- Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets☆118Updated last month
- A DuckDB extension to read data directly from databases supporting the ODBC interface☆86Updated 2 years ago
- Using Mosaic and DuckDB within Observable Framework☆46Updated last year
- A light-weight wrapper for the Datawrapper API.☆86Updated last week
- Blocking records for record linkage and data deduplication based on ANN algorithms in Python.☆17Updated last month
- A simple command line interface to the datamade/dedupe library.☆43Updated 3 years ago
- Use DuckDB within Excel with the xlDuckDb addin☆112Updated last month