The SQL/Ibis powered sklearn of record linkage
☆24May 11, 2026Updated last month
Alternatives and similar repositories for mismo
Users that are interested in mismo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Extract structured data from free text using large language models☆23Jun 9, 2026Updated last week
- Blocking records for record linkage and data deduplication based on ANN algorithms in Python.☆20Mar 9, 2026Updated 3 months ago
- A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning☆139Feb 15, 2026Updated 4 months ago
- DuckDB Engine as Google Sheets Library☆19Dec 14, 2024Updated last year
- Python version of dbtools☆12Jul 30, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- IbisML is a library for building scalable ML pipelines using Ibis.☆118Jul 27, 2025Updated 10 months ago
- Probabilistic Record Linkage Using Pretrained Text Embeddings☆18Apr 15, 2026Updated 2 months ago
- ⚠️ Templates of tools to help prevent committing sensitive data to github☆33Jan 6, 2021Updated 5 years ago
- An End-to-End Evaluation Framework for Entity Resolution Systems☆37Dec 3, 2023Updated 2 years ago
- Python implementations of record linkage blocking techniques.☆21Oct 2, 2023Updated 2 years ago
- This repository accompanies Hands On Entity Resolution by O'Reilly☆32Mar 1, 2024Updated 2 years ago
- Database backend support for Arquero☆24Oct 31, 2022Updated 3 years ago
- A graph query engine☆26May 29, 2026Updated 2 weeks ago
- Creating Debian Packages from CRAN Sources☆12Jul 1, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Parent repository for the MOJ Analytics Platform☆14Nov 16, 2021Updated 4 years ago
- An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.☆95Mar 22, 2026Updated 2 months ago
- Python-Markdown plugin for image captions☆12May 24, 2023Updated 3 years ago
- Clustering and Link Prediction Evaluation in R☆15Sep 23, 2023Updated 2 years ago
- ⌨️ typed.js R htmlwidgets☆15Dec 12, 2021Updated 4 years ago
- ☆11Apr 6, 2026Updated 2 months ago
- Ibis tutorial repository☆37Jul 8, 2024Updated last year
- Miscellaneous code snippets that I want to have versioned.☆17Mar 5, 2023Updated 3 years ago
- A local first persistent log☆36Sep 14, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆21Dec 19, 2019Updated 6 years ago
- Apache DataFusion Benchmarks☆23May 2, 2026Updated last month
- ☆12Dec 12, 2025Updated 6 months ago
- ☆10Dec 13, 2014Updated 11 years ago
- Ibis Substrait Compiler☆110Updated this week
- Efficient BM25 with DuckDB 🦆☆69Dec 20, 2024Updated last year
- ☆19Apr 21, 2022Updated 4 years ago
- Probabilistic Entity Matching in Python☆13Apr 5, 2017Updated 9 years ago
- R package for formatting ggplot2 charts and applying MoJ corporate colours.☆11Nov 7, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- DuckDB extension to read files within zip archives.☆64May 24, 2026Updated 3 weeks ago
- Time series forecasting for common inflators and economic indices using the forecast package in R.☆11Feb 28, 2017Updated 9 years ago
- An R package for blocking records for record linkage / data deduplication based on approximate nearest neighbours algorithms.☆14May 14, 2026Updated last month
- A serverless duckDB deployment at GCP☆41Aug 30, 2022Updated 3 years ago
- Quickly Extract and Marginalize U.S. Census Tables☆18May 13, 2026Updated last month
- Lightweight validation tool for checking function arguments and data analysis scripts.☆12Dec 24, 2024Updated last year
- Jupyter Cell / Line Magics for DuckDB☆59Apr 10, 2026Updated 2 months ago