Valires / Awesome-Entity-Resolution
List of entity resolution software and resources.
☆38Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for Awesome-Entity-Resolution
- An End-to-End Evaluation Framework for Entity Resolution Systems☆26Updated 11 months ago
- A browser user interface for manual labeling of record pairs.☆41Updated last year
- An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.☆73Updated last week
- The SQL/Ibis powered sklearn of record linkage☆14Updated this week
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 3 years ago
- Tools for interactive visual exploration of semantic embeddings.☆29Updated 2 months ago
- Interactive notebooks containing demonstration code of the splink library☆37Updated 10 months ago
- PyPi module for Graphlet AI Knowledge Graph Factory☆28Updated last year
- quadipy is a python package to help transform structured data into RDF graph format☆18Updated last year
- A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologies☆17Updated 2 years ago
- Benchmark study on KùzuDB, an embedded OLAP graph database, on an artificial social network dataset☆28Updated 3 months ago
- Sord Data Fabric: A Vue 3 frontend with a Python WebSocket server, leveraging a distributed architecture with DeltaLake and DuckDB worker…☆18Updated 11 months ago
- A maximum-strength name parser for record linkage.☆34Updated 3 months ago
- A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning☆105Updated 5 months ago
- Resources for tackling record linkage / deduplication / data matching problems☆112Updated 8 months ago
- ☄️ Parallel and distributed training with spaCy and Ray☆54Updated last year
- Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully custom…☆44Updated 4 months ago
- spaCy extension for Visual Studio Code☆25Updated last year
- Ibis analytics, with Ibis (and more!)☆19Updated last month
- Graph Engine for Exploration and Search☆40Updated 9 months ago
- Python package for deduplication/entity resolution using active learning☆78Updated 2 months ago
- spaCy entry points for Curated Transformers☆25Updated last month
- Playing with Python Bluesky SDK☆13Updated this week
- A Flexible Deep Learning Approach to Fuzzy String Matching☆139Updated last month
- new skills taxonomy using TextKernel data☆30Updated 2 years ago
- Python implementation of anonymous linkage using cryptographic linkage keys☆63Updated 6 months ago
- Implementation of the Cypher language for searching NetworkX graphs☆83Updated this week
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.☆60Updated this week
- Dataframe Integration with spaCy.☆101Updated 3 years ago
- DuckDB Community Extension to prompt LLMs from SQL☆22Updated last week