PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.
☆161Nov 18, 2022Updated 3 years ago
Alternatives and similar repositories for entity-embed
Users that are interested in entity-embed are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A tool to generate fixed-width CNAB240 files to perform bulk payments☆21Jul 1, 2022Updated 3 years ago
- Vinta's ESLint and Prettier shareable configs.☆23Feb 19, 2024Updated 2 years ago
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆17Dec 20, 2021Updated 4 years ago
- Code for the paper "Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond…☆24May 31, 2022Updated 3 years ago
- Distributed Bayesian Entity Resolution in Apache Spark☆60Jun 10, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- FHIR-native live chat mobile app built with React Native and Medplum☆23Feb 17, 2025Updated last year
- Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …☆30Apr 5, 2023Updated 3 years ago
- JedAI-WebApp is a GUI that facilitates the execution of JedAI. JedAI is an open source, high scalability toolkit that offers out-of-the-b…☆26Apr 14, 2023Updated 3 years ago
- An open source, high scalability toolkit in Java for Entity Resolution.☆224Jul 12, 2025Updated 9 months ago
- A list of free data matching and record linkage software.☆403Feb 21, 2024Updated 2 years ago
- The dataset for the paper "Machamp: A Generalized Entity Matching Benchmark" published in CIKM 2021☆21Oct 18, 2021Updated 4 years ago
- The engine behind Vinta's Lessons Learned page.☆38Dec 26, 2022Updated 3 years ago
- Open-source free TypeScript library to implement SMART Health Cards and Links☆25Apr 1, 2026Updated last month
- A Django 2.1 project to reproduce WebKit Bug 188165 and Django Ticket #30250☆15Mar 29, 2019Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Record Linkage ToolKit (Find and link entities)☆111Aug 14, 2023Updated 2 years ago
- Fuzzy Categorical Distances☆14Mar 31, 2020Updated 6 years ago
- An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.☆92Mar 22, 2026Updated last month
- Easy Django integration with Elasticsearch through ZomboDB Postgres Extension☆148Dec 28, 2022Updated 3 years ago
- Efficient String Comparison Functions and Fuzzy String Matching☆20Sep 21, 2025Updated 7 months ago
- List of entity resolution software and resources.☆117Mar 24, 2026Updated last month
- Improve performance and maintainability with a prefetching layer in your Django project☆156Jun 23, 2025Updated 10 months ago
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,048Feb 21, 2024Updated 2 years ago
- Scalable identity resolution, entity resolution, data mastering and deduplication using ML☆1,188Apr 24, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.☆4,455Jul 29, 2025Updated 9 months ago
- Portal do Grupo de Usuários Python de Pernambuco☆16Mar 10, 2012Updated 14 years ago
- Useful checklist for building great Celery tasks.☆121Oct 7, 2019Updated 6 years ago
- The relevant React Events Library.☆21Jan 6, 2023Updated 3 years ago
- Integrate AI Assistants with Django to build intelligent applications☆409Mar 23, 2026Updated last month
- A tutorial on entity resolution (record linkage or de-duplication)☆65Jun 30, 2020Updated 5 years ago
- Resources for tackling record linkage / deduplication / data matching problems☆127Feb 22, 2024Updated 2 years ago
- LEMON: Explainable Entity Matching☆19Apr 6, 2022Updated 4 years ago
- An R package for blocking records for record linkage / data deduplication based on approximate nearest neighbours algorithms.☆14Apr 9, 2026Updated 3 weeks ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends☆2,111Updated this week
- Python API client generator☆350Jun 6, 2023Updated 2 years ago
- Vinta's Best Moves Compiled☆228Nov 22, 2023Updated 2 years ago
- ☆193May 29, 2024Updated last year
- A collection of Django security-related tools and libs.☆219Feb 24, 2025Updated last year
- my sketches created with processing☆45Nov 16, 2025Updated 5 months ago
- Perform Bayesian record linkage with a one-to-one matching assumption.☆11Jul 9, 2020Updated 5 years ago