scify / JedAI-SparkLinks
☆15Updated 3 years ago
Alternatives and similar repositories for JedAI-Spark
Users that are interested in JedAI-Spark are comparing it to the libraries listed below
Sorting:
- An open source, high scalability toolkit in Java for Entity Resolution.☆221Updated 4 months ago
- An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.☆85Updated 2 weeks ago
- A systematic Benchmarking on the performance of Spark-SQL for processing Vast RDF datasets☆14Updated 3 years ago
- ☆193Updated last year
- importing Thomson Reuters' permID dataset into Neo4j☆19Updated 7 years ago
- Record Linkage ToolKit (Find and link entities)☆110Updated 2 years ago
- High-performance data retrieval from Neo4j with Apache Arrow 🏹☆32Updated 3 years ago
- Record matching and entity resolution at scale in Spark☆35Updated 2 years ago
- SparkER: an Entity Resolution framework for Apache Spark☆65Updated last year
- ☆32Updated 4 years ago
- Resources for tackling record linkage / deduplication / data matching problems☆125Updated last year
- A tool facilitating matching for any dataset discovery method. Also, an extensible experiment suite for state-of-the-art schema matching …☆97Updated last month
- Entity resolution for Elasticsearch.☆163Updated last month
- Minoan ER is an Entity Resolution (ER) framework, built by researchers in Crete (the land of the ancient Minoan civilization). Entity res…☆17Updated 5 years ago
- Applications and APIs from Oracle Graph☆52Updated 3 weeks ago
- ☆79Updated 2 years ago
- TableAnnotation is a semantic annotation tool for tables leveraging three steps: table preprocessing, entity lookup and annotation (Cell-…☆16Updated last year
- An End-to-End Evaluation Framework for Entity Resolution Systems☆32Updated last year
- JedAI-WebApp is a GUI that facilitates the execution of JedAI. JedAI is an open source, high scalability toolkit that offers out-of-the-b…☆24Updated 2 years ago
- Comprises the whole SANSA stack☆15Updated 5 years ago
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆142Updated last year
- Examples for working with and extending Stardog☆80Updated 3 weeks ago
- Big Data RDF Processing and Analytics Stack built on Apache Spark and Apache Jena http://sansa-stack.github.io/SANSA-Stack/☆148Updated 2 months ago
- ☆11Updated 2 years ago
- Graph databases, Knowledge Graphs, SPARQ☆80Updated 4 years ago
- WInte.r is a Java framework for end-to-end data integration. The WInte.r framework implements well-known methods for data pre-processing,…☆111Updated 3 years ago
- A list of free data matching and record linkage software.☆394Updated last year
- Search relevance evaluation toolkit☆34Updated 3 years ago
- The complete graph data science platform☆140Updated 9 months ago
- Code for the paper "Deep Entity Matching with Pre-trained Language Models"☆297Updated last year