The SQL/Ibis powered sklearn of record linkage
☆24Mar 25, 2026Updated this week
Alternatives and similar repositories for mismo
Users that are interested in mismo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Blocking records for record linkage and data deduplication based on ANN algorithms in Python.☆19Mar 9, 2026Updated 2 weeks ago
- Fast, accurate, open-source geocoding in Python☆71Feb 17, 2026Updated last month
- ☆52Mar 19, 2026Updated last week
- A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning☆135Feb 15, 2026Updated last month
- DuckDB Engine as Google Sheets Library☆20Dec 14, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Distributed Bayesian Entity Resolution in Apache Spark☆59Jun 10, 2021Updated 4 years ago
- Probabilistic Record Linkage Using Pretrained Text Embeddings☆18Feb 23, 2026Updated last month
- ⚠️ Templates of tools to help prevent committing sensitive data to github☆33Jan 6, 2021Updated 5 years ago
- Database backend support for Arquero☆24Oct 31, 2022Updated 3 years ago
- A graph query engine☆23Nov 25, 2025Updated 4 months ago
- Fully unit tested utility functions for data engineering. Python 3 only.☆18Mar 12, 2026Updated 2 weeks ago
- Creating Debian Packages from CRAN Sources☆12Jul 1, 2020Updated 5 years ago
- Social value orientation (SVO) notes for pro-social pro-self concepts☆12Apr 14, 2025Updated 11 months ago
- Repository of web and code editor friendly Observable Data Toools 🛠️ and Notebooks 📚 in .js, .nb.json, .ojs, .omd, .html and .qmd docum…☆34Aug 31, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Parent repository for the MOJ Analytics Platform☆14Nov 16, 2021Updated 4 years ago
- An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.☆90Updated this week
- Files and instructions for unattended/automatic setup of a Raspberry Pi using only the boot partition which you can see on a flashed SD c…☆14Aug 7, 2021Updated 4 years ago
- An R package to help assess the sensitivity of a Bayesian model (fitted with Stan) to the specification of its likelihood and priors☆11Apr 8, 2025Updated 11 months ago
- Python wrapper for a C++ Double Metaphone☆15Jan 12, 2026Updated 2 months ago
- Perform Bayesian record linkage with a one-to-one matching assumption.☆11Jul 9, 2020Updated 5 years ago
- A free offline-first trail map with up-to-date waypoints and comments☆18Apr 5, 2023Updated 2 years ago
- Inspect a URL and estimate if it contains a news story☆39Feb 11, 2026Updated last month
- dbt docs but windows 95☆16Jun 7, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Python bindings for the USPS Ecommerce APIs☆21Apr 6, 2022Updated 3 years ago
- MS5803-14BA water pressure/depth sensor available from SparkFun Electronics.☆14Jun 12, 2019Updated 6 years ago
- ⌨️ typed.js R htmlwidgets☆15Dec 12, 2021Updated 4 years ago
- ☆10Nov 6, 2025Updated 4 months ago
- Ibis tutorial repository☆36Jul 8, 2024Updated last year
- Miscellaneous code snippets that I want to have versioned.☆17Mar 5, 2023Updated 3 years ago
- Access Amazon's AWS Athena API via reticulate and AWS official Python boto3 module☆10Sep 24, 2018Updated 7 years ago
- Apache DataFusion Benchmarks☆22Mar 3, 2026Updated 3 weeks ago
- ☆11Dec 12, 2025Updated 3 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Ibis Substrait Compiler☆110Mar 19, 2026Updated last week
- R package for formatting ggplot2 charts and applying MoJ corporate colours.☆17Nov 7, 2024Updated last year
- Efficient BM25 with DuckDB 🦆☆65Dec 20, 2024Updated last year
- Reusable Pattern Matching on Python Objects☆20Oct 9, 2024Updated last year
- DuckDB extension to read files within zip archives.☆58Mar 9, 2026Updated 2 weeks ago
- Probabilistic Entity Matching in Python☆13Apr 5, 2017Updated 8 years ago
- An R package for blocking records for record linkage / data deduplication based on approximate nearest neighbours algorithms.☆14Mar 11, 2026Updated 2 weeks ago