scify / JedAI-SparkLinks
☆15Updated 2 years ago
Alternatives and similar repositories for JedAI-Spark
Users that are interested in JedAI-Spark are comparing it to the libraries listed below
Sorting:
- UI for JedAI Toolkit☆17Updated 3 years ago
- JedAI-WebApp is a GUI that facilitates the execution of JedAI. JedAI is an open source, high scalability toolkit that offers out-of-the-b…☆23Updated 2 years ago
- An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.☆78Updated last month
- Record matching and entity resolution at scale in Spark☆34Updated last year
- ☆32Updated 3 years ago
- SparkER: an Entity Resolution framework for Apache Spark☆65Updated last year
- An End-to-End Evaluation Framework for Entity Resolution Systems☆29Updated last year
- High-performance data retrieval from Neo4j with Apache Arrow 🏹☆31Updated 2 years ago
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 4 years ago
- ☆11Updated last year
- Record Linkage ToolKit (Find and link entities)☆110Updated last year
- This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs o…☆47Updated 7 years ago
- An open source, high scalability toolkit in Java for Entity Resolution.☆218Updated last year
- Resources for tackling record linkage / deduplication / data matching problems☆124Updated last year
- Python implementations of record linkage blocking techniques.☆21Updated last year
- Minoan ER is an Entity Resolution (ER) framework, built by researchers in Crete (the land of the ancient Minoan civilization). Entity res…☆17Updated 4 years ago
- List of entity resolution software and resources.☆75Updated 4 months ago
- Fork of the Freely Extensible Biomedical Record Linkage program☆24Updated 8 years ago
- T2K Match is a matching algorithm optimised to match millions of web tables to a central knowledge base.☆21Updated 7 years ago
- FlexMatcher is a schema matching package in Python which handles the problem of matching multiple schemas to a single mediated schema.☆29Updated 6 months ago
- ☆11Updated 6 years ago
- ☆190Updated last year
- importing Thomson Reuters' permID dataset into Neo4j☆19Updated 7 years ago
- Notebooks for the ML Link Prediction Course☆14Updated 4 years ago
- pyspark-parallelised functions producing graph-theoretical metrics in connected component clusters for use in record-linkage (or other do…☆10Updated last year
- Queries for modeling, importing, and analyzing multi-dimensional event data using the Labeled Property Graph data model of Graph Database…☆18Updated 2 years ago
- Optimal distributed data deduplication and supervised learning pipeline using Apache Spark☆10Updated 4 years ago
- Repository of Notebooks taken from https://neo4j.com/graph-algorithms-book/☆26Updated 5 years ago
- Interactive notebooks containing demonstration code of the splink library☆38Updated last year
- Binding the GDELT universe in a Spark environment☆25Updated 2 years ago