Entity resolution, also known as Data Matching or Record linkage is the task of finding a data set that refer to the same or similar real entity across different digital entities present on same or different data sets. Record linking is necessary when joining different entities which are similar and may or may not share some common identifiers. …
☆32Apr 8, 2025Updated 10 months ago
Alternatives and similar repositories for entity-resolution
Users that are interested in entity-resolution are comparing it to the libraries listed below
Sorting:
- Using modal.com to process FineWeb-edu data☆20Apr 5, 2025Updated 10 months ago
- python library to perform Locality-Sensitive Hashing for faster nearest neighbors search in high dimensional data☆19Aug 15, 2024Updated last year
- ☆12Jan 17, 2026Updated last month
- Knowledge graph extraction from text using OpenAI ChatGPT for graph extraction and Neo4j for DB storage☆11Feb 26, 2024Updated 2 years ago
- ☆10May 25, 2021Updated 4 years ago
- A Python Reddit scraper with dual-mode architecture: simple requests for small jobs, async + proxy rotation for large-scale scraping. Fea…☆16Oct 30, 2025Updated 4 months ago
- Minimalist library for LLM usage☆13Sep 7, 2025Updated 5 months ago
- Automated Continuous Data Quality Measurement☆12Nov 15, 2023Updated 2 years ago
- Architecture of Twint scrapper which allow download tweets on many instances without api restrictions☆10Nov 30, 2020Updated 5 years ago
- Python library for the simulation of probabilistic circuits.☆11Feb 1, 2026Updated last month
- Framework for studying cryptographic hash functions using SAT.☆10Dec 21, 2021Updated 4 years ago
- Causality in Knowledge Graphs☆11Oct 12, 2022Updated 3 years ago
- An AI-powered web application leveraging Next.js 14 and TensorFlow.js for real-time object detection. Utilizing Tensorflow model for accu…☆12Dec 3, 2024Updated last year
- Scrape most mentioned stock tickers from Reddit. Wallstreetbets and Wallstreetbetsnew☆12Mar 5, 2021Updated 4 years ago
- CSC 424 Advanced Database Management Systems☆16Jan 1, 2020Updated 6 years ago
- A single source of truth for data definitions☆11Dec 10, 2022Updated 3 years ago
- Data pipelines for AI applications☆12Feb 2, 2026Updated last month
- Twitter based sentiment analysis using JAVA and Hadoop. In this project we are doing the sentiment analysis on twitter data to analyse wh…☆10Apr 22, 2018Updated 7 years ago
- A fast TUI application (with optional webui) to visually navigate and inspect JSON and JSONL data. Easily localize parse errors in large …☆15Sep 30, 2024Updated last year
- This program meshes Volumetric Video recorded with LiveScan3D☆10Dec 17, 2020Updated 5 years ago
- Bayesian probability transforms for BM25 retrieval scores☆40Updated this week
- PyTorch Implementation of Context-Aware Sequential Model for Multi-Behaviour Recommendation https://arxiv.org/abs/2312.09684☆10May 31, 2024Updated last year
- Various Vector Similarity Search examples☆13Dec 30, 2022Updated 3 years ago
- The program can be used to scrape the content from an article from web by an input of a set of URLs in a text file or a URL. This project…☆17Aug 5, 2020Updated 5 years ago
- Canadian threat feeds updated every 12 hours.☆20Updated this week
- The official Python library for the Writer API☆11Updated this week
- A Python package for accessing the OpenCorporates API☆11Feb 12, 2019Updated 7 years ago
- The Dynamic Rules Engine is a serverless application that enables real-time evaluation of rules against sensor data, leveraging AWS Kines…☆11Sep 25, 2024Updated last year
- Repo for the Nuxt 3 Fundamentals Course☆11Aug 27, 2023Updated 2 years ago
- ApertureDB Python Client☆12Jan 14, 2026Updated last month
- Examples for using the Pipl SEARCH API☆11Dec 19, 2023Updated 2 years ago
- A sample set of notebooks demonstrating Amazon Comprehend capabilities.☆46Nov 28, 2023Updated 2 years ago
- A pastebin to find and share useful resources 📚☆46May 17, 2023Updated 2 years ago
- ☆12Jul 6, 2021Updated 4 years ago
- ☆10Nov 12, 2022Updated 3 years ago
- A list of publicly available resources regarding the SAS7BDAT file format☆11Jan 10, 2022Updated 4 years ago
- Build wordlists from the common-crawl index☆12Oct 9, 2022Updated 3 years ago
- Generic API clients based on Pydantic and protocols☆13Feb 24, 2026Updated last week
- Generate custom Mac OS folder icons with a desired image as stamp☆12Oct 3, 2023Updated 2 years ago