NickCrews / mismo
The SQL/Ibis powered sklearn of record linkage
☆13Updated this week
Alternatives and similar repositories for mismo:
Users that are interested in mismo are comparing it to the libraries listed below
- Jupyter Cell / Line Magics for DuckDB☆46Updated this week
- List of entity resolution software and resources.☆53Updated 10 months ago
- An experimental Athena extension for DuckDB 🐤☆53Updated last month
- An End-to-End Evaluation Framework for Entity Resolution Systems☆26Updated last year
- A maximum-strength name parser for record linkage.☆36Updated 5 months ago
- Linear regression in SQL using dbt☆69Updated 2 weeks ago
- ☆10Updated 3 years ago
- Sentiment and language detection for text analytics.☆16Updated 6 months ago
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆88Updated this week
- asyncio bridge to the duckdb library☆35Updated last year
- pseudopeople is a Python package that generates realistic simulated data about a fictional United States population, designed for use in …☆20Updated this week
- Ibis analytics, with Ibis (and more!)☆20Updated 4 months ago
- ☆14Updated last year
- A SQLite adapter plugin for dbt (data build tool)☆77Updated last week
- A repository of runnable examples using ibis☆42Updated 6 months ago
- ☆85Updated 8 months ago
- Interactive notebooks containing demonstration code of the splink library☆37Updated last year
- A webring for data people who write☆26Updated last month
- 🌎 Polars H3 Geospatial Plugin☆44Updated this week
- Write your dbt models using Ibis☆58Updated 3 weeks ago
- Examples for the MotherDuck WASM Client library, enabling MotherDuck integration for WebAssembly-powered DuckDB☆44Updated last month
- ☆59Updated 2 months ago
- Read Apache Arrow batches from ODBC data sources in Python☆61Updated this week
- NormConf Goodies API☆22Updated 2 years ago
- ☆16Updated last year
- IbisML is a library for building scalable ML pipelines using Ibis.☆96Updated last month
- The Modern Data Stack in a Python package☆49Updated last year
- Entity Matching Model solves the problem of matching company names between two possibly very large datasets.☆64Updated last month
- A library to convert a pydantic model to a pyarrow schema☆23Updated 2 months ago
- DuckDB SQL Tools add DuckDB support to VSCode, and provide database schema and SQL query interfaces for the popular SQLTools extension, S…☆13Updated 6 months ago