A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning
☆135Feb 15, 2026Updated last month
Alternatives and similar repositories for linktransformer
Users that are interested in linktransformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The SQL/Ibis powered sklearn of record linkage☆24Updated this week
- An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.☆90Mar 22, 2026Updated last week
- An End-to-End Evaluation Framework for Entity Resolution Systems☆36Dec 3, 2023Updated 2 years ago
- Repository for in class material for Data Bootcamp☆13May 18, 2019Updated 6 years ago
- This repository aims to build a comprehensive literature review of the economics of open source software. Contributions welcome.☆12Apr 2, 2025Updated 11 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆15Aug 11, 2022Updated 3 years ago
- Distributed Bayesian Entity Resolution in Apache Spark☆59Jun 10, 2021Updated 4 years ago
- 📰🗞 New York Times data☆12Aug 4, 2018Updated 7 years ago
- The repository for PoliPrompt☆18Oct 20, 2024Updated last year
- Specification Curve is a Python package that performs specification curve analysis: exploring how a coefficient varies under multiple dif…☆29Updated this week
- Continuous Benchmark of Filtering methods for Entity Resolution☆11Jul 20, 2025Updated 8 months ago
- pseudopeople is a Python package that generates realistic simulated data about a fictional United States population, designed for use in …☆24Updated this week
- A tutorial on entity resolution (record linkage or de-duplication)☆65Jun 30, 2020Updated 5 years ago
- ☆13Jan 10, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- This repository contains code and extensive prompt examples to reproduce and extend the experiments in our papers "Using ChatGPT for Enti…☆65Oct 18, 2024Updated last year
- ☆18Mar 18, 2026Updated last week
- UI for JedAI Toolkit☆17May 20, 2022Updated 3 years ago
- Blocking records for record linkage and data deduplication based on ANN algorithms in Python.☆20Mar 9, 2026Updated 2 weeks ago
- PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolut…☆161Nov 18, 2022Updated 3 years ago
- Tools for ILO Open Data via ILOSTAT bulk download facility or SDMX web service☆38Oct 1, 2025Updated 5 months ago
- Causal Inference in Observational Data with Unobserved Heterogeneity (Lecture Notes. Masters/PhD-level)☆39Feb 10, 2026Updated last month
- Repository for introductory training materials for overlapping generations modeling☆13Oct 30, 2024Updated last year
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,048Feb 21, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends☆2,021Updated this week
- Similarity and distance measures for clustering and record linkage applications in R☆18Sep 23, 2025Updated 6 months ago
- Income Accounting☆17Feb 11, 2021Updated 5 years ago
- Ecological mixed-effects ordination with lme4☆12May 9, 2016Updated 9 years ago
- Demo of a supervised machine learning approach for Entity Resolution in graph using Neo4j GDS Link Prediction Pipelines☆22Apr 11, 2022Updated 3 years ago
- Parse Searchable Electoral Rolls☆11Apr 20, 2025Updated 11 months ago
- Implements several Markov chain Monte Carlo (MCMC) algorithms for the latent Dirichlet allocation (LDA) model☆11Feb 11, 2020Updated 6 years ago
- ☆22Jul 15, 2024Updated last year
- Coefficient stability plots in R☆53Jan 29, 2021Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A Julia package for solving heterogenous-agent economic models using reinforcement learning☆19Jul 28, 2022Updated 3 years ago
- Evaluation and benchmarking of PatentsView disambiguation algorithms☆15Jan 18, 2024Updated 2 years ago
- scraping and querying documents for LLMs☆24Oct 6, 2025Updated 5 months ago
- SAE Unit/area Models and Methods for Estimation in R☆26Feb 24, 2026Updated last month
- Quarto extension to implement RevealJS code-focus☆14Jan 28, 2023Updated 3 years ago
- A maximum-strength name parser for record linkage.☆39Sep 3, 2025Updated 6 months ago
- The source code of the Sudowoodo paper in ICDE 2023☆18May 24, 2023Updated 2 years ago