HPI-Information-Systems / metanome-algorithmsLinks
Source code for several Metanome data profiling algorithms
☆57Updated 2 years ago
Alternatives and similar repositories for metanome-algorithms
Users that are interested in metanome-algorithms are comparing it to the libraries listed below
Sorting:
- The BART Project: Benchmarking Algorithms for (data) Repairing and Translation☆42Updated last year
- A tool facilitating matching for any dataset discovery method. Also, an extensible experiment suite for state-of-the-art schema matching …☆90Updated 3 months ago
- A Benchmark for Joint Data Cleaning and Machine Learning☆49Updated last year
- ☆79Updated 2 years ago
- ☆60Updated 2 months ago
- FDX, SIGMOD 2020☆19Updated last year
- ☆192Updated last year
- Data-Centric What-If Analysis for Native Machine Learning Pipelines☆16Updated 2 years ago
- A Generalized Data Cleaning System☆51Updated 9 years ago
- Code and Benchmarks for JOSIE (SIGMOD 2019)☆19Updated 2 years ago
- Applications and APIs from Oracle Graph☆51Updated 2 weeks ago
- Reference implementations for LDBC Social Network Benchmark's Interactive workload.☆107Updated 7 months ago
- ☆19Updated 3 years ago
- A System for Optimized Semantic Computation☆136Updated this week
- Characterization of relational table embeddings (VLDB 2024).☆31Updated last year
- Synthetic graph generator for the LDBC Social Network Benchmark, running on Spark☆175Updated 4 months ago
- An open source, high scalability toolkit in Java for Entity Resolution.☆221Updated last month
- A Jupyter notebook extension to centralize and manage data☆15Updated 2 years ago
- An experimental Graph Streaming API for Apache Flink☆141Updated 4 years ago
- Python package for performing Entity and Text Matching using Deep Learning.☆599Updated last year
- Repository with an overview of the tutorial on Models and Practice of Neural Table Representations and up to date material for the hands-…☆21Updated 2 years ago
- DBEst, AQP engine☆18Updated last year
- Distributed Temporal Graph Analytics with Apache Flink☆249Updated last week
- A Machine Learning System for Data Enrichment.☆524Updated 2 years ago
- simialrity join or search on spark core directly☆27Updated 5 years ago
- A polystore database from researchers of the Intel Science and Technology Center for Big Data☆38Updated 2 years ago
- SparkER: an Entity Resolution framework for Apache Spark☆65Updated last year
- Jenga is an experimentation library that allows data science practititioners and researchers to study the effect of common data corruptio…☆41Updated 2 years ago
- WInte.r is a Java framework for end-to-end data integration. The WInte.r framework implements well-known methods for data pre-processing,…☆110Updated 3 years ago
- Interactive-Speed Analytics: 200x Faster, 200x Fewer Cluster Resources, Approximate Query Processing☆250Updated 4 years ago