NickCrews / mismo
The SQL/Ibis powered sklearn of record linkage
☆13Updated last week
Alternatives and similar repositories for mismo:
Users that are interested in mismo are comparing it to the libraries listed below
- Jupyter Cell / Line Magics for DuckDB☆45Updated last week
- Linear regression in SQL using dbt☆68Updated last month
- A Singer.io target for DuckDB☆17Updated 4 months ago
- Ibis analytics, with Ibis (and more!)☆20Updated 4 months ago
- An experimental Athena extension for DuckDB 🐤☆53Updated last month
- ☆10Updated 3 years ago
- Use DuckDB within Excel with the xlDuckDb addin☆70Updated 2 months ago
- List of entity resolution software and resources.☆56Updated 11 months ago
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆92Updated this week
- A maximum-strength name parser for record linkage.☆36Updated last week
- A serverless duckDB deployment at GCP☆38Updated 2 years ago
- Examples for the MotherDuck WASM Client library, enabling MotherDuck integration for WebAssembly-powered DuckDB☆50Updated last week
- A DuckDB extension to read data directly from databases supporting the ODBC interface☆82Updated last year
- Repo for orienting dbt users to the Dagster asset framework☆53Updated 2 years ago
- ☆86Updated 9 months ago
- ☆16Updated last year
- An End-to-End Evaluation Framework for Entity Resolution Systems☆26Updated last year
- ☆53Updated 7 months ago
- A SQLite adapter plugin for dbt (data build tool)☆77Updated 3 weeks ago
- fst: flow state tool | smooth where you want it, friction where you need it when data engineering☆34Updated last year
- asyncio bridge to the duckdb library☆36Updated last year
- A repository of runnable examples using ibis☆42Updated 7 months ago
- quadipy is a python package to help transform structured data into RDF graph format☆19Updated last year
- pyspark-parallelised functions producing graph-theoretical metrics in connected component clusters for use in record-linkage (or other do…☆10Updated last year
- 🏁 A sweet and speedy code generator for dbt 🏎️✨☆25Updated 8 months ago
- A light-weight wrapper for the Datawrapper API.☆62Updated 7 months ago
- Using DuckDB with AWS Lambda to process Delta Lake data☆20Updated 3 weeks ago
- This repository contains CROW, the Clerical Resolution Online Widget, an open-source project designed to help data linkers with their cle…☆10Updated 3 months ago
- ☆82Updated last year