OlivierBinette / Awesome-Entity-Resolution
List of entity resolution software and resources.
☆63Updated 2 months ago
Alternatives and similar repositories for Awesome-Entity-Resolution:
Users that are interested in Awesome-Entity-Resolution are comparing it to the libraries listed below
- An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.☆76Updated this week
- quadipy is a python package to help transform structured data into RDF graph format☆19Updated 2 years ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆57Updated last week
- Ibis analytics, with Ibis (and more!)☆21Updated 7 months ago
- PyPi module for Graphlet AI Knowledge Graph Factory☆29Updated 2 years ago
- An End-to-End Evaluation Framework for Entity Resolution Systems☆27Updated last year
- ☆38Updated 2 months ago
- dagster scikit-learn pipeline example.☆44Updated 2 years ago
- DuckDB Community Extension to prompt LLMs from SQL☆45Updated 3 months ago
- Python package for deduplication/entity resolution using active learning☆78Updated 8 months ago
- Interactive notebooks containing demonstration code of the splink library☆38Updated last year
- Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully custom…☆44Updated 9 months ago
- Implementation of the Cypher language for searching NetworkX graphs☆103Updated last week
- Loading OpenSanctions into Neo4J and Linkurious☆28Updated 4 months ago
- Sord Data Fabric: A Vue 3 frontend with a Python WebSocket server, leveraging a distributed architecture with DeltaLake and DuckDB worker…☆18Updated last year
- Record matching and entity resolution at scale in Spark☆34Updated last year
- [Project moved] Polars integration for Dagster☆36Updated last week
- Demo Project for Open Source MDS☆168Updated this week
- Example Dagster Cloud code for the Hooli Data Engineering organization.☆1Updated 3 weeks ago
- Playing with Python Bluesky SDK☆14Updated 5 months ago
- ☆15Updated 2 years ago
- A maximum-strength name parser for record linkage.☆36Updated 3 weeks ago
- Department of Education (DOE) for New South Wales (AUS) data stack in a box☆32Updated 5 months ago
- The Modern Data Stack in a Python package☆49Updated last year
- Playground for using large language models into the Modern Data Stack for entity matching☆107Updated 2 years ago
- pyspark-parallelised functions producing graph-theoretical metrics in connected component clusters for use in record-linkage (or other do…☆10Updated last year
- CLI to create an ER Diagram from DuckDB database files☆119Updated last month
- The SQL/Ibis powered sklearn of record linkage☆15Updated this week
- Tools for interactive visual exploration of semantic embeddings.☆32Updated 7 months ago
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆76Updated 2 months ago