A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning
☆139Feb 15, 2026Updated 2 months ago
Alternatives and similar repositories for linktransformer
Users that are interested in linktransformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The SQL/Ibis powered sklearn of record linkage☆23Apr 19, 2026Updated 2 weeks ago
- An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.☆94Mar 22, 2026Updated last month
- An End-to-End Evaluation Framework for Entity Resolution Systems☆36Dec 3, 2023Updated 2 years ago
- Probabilistic Record Linkage Using Pretrained Text Embeddings☆18Apr 15, 2026Updated 3 weeks ago
- Repository for in class material for Data Bootcamp☆14May 18, 2019Updated 6 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆15Aug 11, 2022Updated 3 years ago
- 📰🗞 New York Times data☆12Aug 4, 2018Updated 7 years ago
- Specification Curve is a Python package that performs specification curve analysis: exploring how a coefficient varies under multiple dif…☆29Updated this week
- blackmaRble: retrieve, wrangle and plot VIIRS Black Marble nighttimelight data in R☆18Dec 21, 2023Updated 2 years ago
- Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"☆12Mar 25, 2025Updated last year
- ☆11Apr 2, 2021Updated 5 years ago
- An R package for blocking records for record linkage / data deduplication based on approximate nearest neighbours algorithms.☆14Apr 9, 2026Updated last month
- A tutorial on entity resolution (record linkage or de-duplication)☆65Jun 30, 2020Updated 5 years ago
- This repository contains code and extensive prompt examples to reproduce and extend the experiments in our papers "Using ChatGPT for Enti…☆66Oct 18, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆18Apr 27, 2026Updated last week
- Blocking records for record linkage and data deduplication based on ANN algorithms in Python.☆20Mar 9, 2026Updated 2 months ago
- PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolut…☆161Nov 18, 2022Updated 3 years ago
- ☆19Jul 22, 2023Updated 2 years ago
- A Tool for the Congress Data dataset☆26Dec 8, 2025Updated 5 months ago
- Tools for ILO Open Data via ILOSTAT bulk download facility or SDMX web service☆38Apr 24, 2026Updated 2 weeks ago
- Causal Inference in Observational Data with Unobserved Heterogeneity (Lecture Notes. Masters/PhD-level)☆40Feb 10, 2026Updated 2 months ago
- This is an R wrapper for the APIs on government of India's open data platform - data.gov.in.☆18Sep 22, 2024Updated last year
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,049Feb 21, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆19Jan 4, 2024Updated 2 years ago
- Income Accounting☆17Feb 11, 2021Updated 5 years ago
- Demo of a supervised machine learning approach for Entity Resolution in graph using Neo4j GDS Link Prediction Pipelines☆22Apr 11, 2022Updated 4 years ago
- JedAI-WebApp is a GUI that facilitates the execution of JedAI. JedAI is an open source, high scalability toolkit that offers out-of-the-b…☆26Apr 14, 2023Updated 3 years ago
- Graduate Environment & Development Economics at the University of Minnesota☆21Jan 21, 2025Updated last year
- Parse Searchable Electoral Rolls☆12Apr 20, 2025Updated last year
- Implements several Markov chain Monte Carlo (MCMC) algorithms for the latent Dirichlet allocation (LDA) model☆11Feb 11, 2020Updated 6 years ago
- Named Entity Recognition with the Nametag Maximum Entropy Markov model☆12Feb 9, 2026Updated 3 months ago
- Coefficient stability plots in R☆53Jan 29, 2021Updated 5 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Evaluation and benchmarking of PatentsView disambiguation algorithms☆16Jan 18, 2024Updated 2 years ago
- A Julia package for solving heterogenous-agent economic models using reinforcement learning☆19Jul 28, 2022Updated 3 years ago
- Quarto extension to implement RevealJS code-focus☆15Jan 28, 2023Updated 3 years ago
- ☆60Apr 29, 2026Updated last week
- The source code of the Sudowoodo paper in ICDE 2023☆19May 24, 2023Updated 2 years ago
- R package fastLink: Fast Probabilistic Record Linkage☆291Feb 28, 2026Updated 2 months ago
- ☆33Aug 30, 2023Updated 2 years ago