PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.
☆161Nov 18, 2022Updated 3 years ago
Alternatives and similar repositories for entity-embed
Users that are interested in entity-embed are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A tool to generate fixed-width CNAB240 files to perform bulk payments☆21Jul 1, 2022Updated 4 years ago
- Vinta's ESLint and Prettier shareable configs.☆23Feb 19, 2024Updated 2 years ago
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆17Dec 20, 2021Updated 4 years ago
- GPTBundle, a React application toolkit, harnesses AI to convert textual content into structured forms and delivers advanced autofill sugg…☆22Mar 27, 2024Updated 2 years ago
- Code for the paper "Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond…☆24May 31, 2022Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Distributed Bayesian Entity Resolution in Apache Spark☆60Jun 10, 2021Updated 5 years ago
- An online jukebox with all the songs from Deezer and YouTube. Built with Django and Angular.☆22Apr 11, 2016Updated 10 years ago
- Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …☆30Apr 5, 2023Updated 3 years ago
- Code for the paper "Deep Entity Matching with Pre-trained Language Models"☆312Apr 17, 2024Updated 2 years ago
- The dataset for the paper "Machamp: A Generalized Entity Matching Benchmark" published in CIKM 2021☆22Oct 18, 2021Updated 4 years ago
- The engine behind Vinta's Lessons Learned page.☆38Dec 26, 2022Updated 3 years ago
- Open-source free TypeScript library to implement SMART Health Cards and Links☆26May 1, 2026Updated 2 months ago
- ☆19Sep 23, 2024Updated last year
- A simple command line interface to the datamade/dedupe library.☆43Dec 26, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A Django 2.1 project to reproduce WebKit Bug 188165 and Django Ticket #30250☆15Mar 29, 2019Updated 7 years ago
- ☆15Aug 11, 2022Updated 3 years ago
- Fuzzy Categorical Distances☆14Mar 31, 2020Updated 6 years ago
- Facebook GraphAPI wrapper using tapioca☆28Jun 1, 2016Updated 10 years ago
- Scrobble your last.fm or Spotify activity to the Gather status.☆15Jul 27, 2024Updated last year
- Twitter API wrapper using tapioca☆16Dec 5, 2017Updated 8 years ago
- Python package for performing Entity and Text Matching using Deep Learning.☆620Jun 18, 2024Updated 2 years ago
- Efficient String Comparison Functions and Fuzzy String Matching☆20Sep 21, 2025Updated 9 months ago
- List of entity resolution software and resources.☆130Mar 24, 2026Updated 3 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,055Feb 21, 2024Updated 2 years ago
- Scalable master data management, identity resolution, entity resolution, and deduplication using ML☆1,226Updated this week
- A browser user interface for manual labeling of record pairs.☆48Jun 23, 2023Updated 3 years ago
- Useful checklist for dealing with recovery crisis. Based on the talk "Saving Great Projects" 2017 Python Brasil☆18Dec 8, 2018Updated 7 years ago
- The relevant React Events Library.☆21Jan 6, 2023Updated 3 years ago
- Resources for tackling record linkage / deduplication / data matching problems☆127Feb 22, 2024Updated 2 years ago
- LEMON: Explainable Entity Matching☆19Apr 6, 2022Updated 4 years ago
- An R package for blocking records for record linkage / data deduplication based on approximate nearest neighbours algorithms.☆14Jun 14, 2026Updated 2 weeks ago
- Dedupe/batch geocode addresses and venues around the world with libpostal☆84Nov 29, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends☆2,225Jun 25, 2026Updated last week
- Entity resolution using zero labeled examples☆33Jun 29, 2024Updated 2 years ago
- Python API client generator☆349Jun 6, 2023Updated 3 years ago
- Blocking records for record linkage and data deduplication based on ANN algorithms in Python.☆20Mar 9, 2026Updated 3 months ago
- The SQL/Ibis powered sklearn of record linkage☆24Jun 12, 2026Updated 2 weeks ago
- Vinta's Best Moves Compiled☆229Nov 22, 2023Updated 2 years ago
- ☆192May 29, 2024Updated 2 years ago