HPI-Information-Systems / metanome-algorithmsLinks
Source code for several Metanome data profiling algorithms
☆56Updated 2 years ago
Alternatives and similar repositories for metanome-algorithms
Users that are interested in metanome-algorithms are comparing it to the libraries listed below
Sorting:
- The BART Project: Benchmarking Algorithms for (data) Repairing and Translation☆42Updated last year
- A tool facilitating matching for any dataset discovery method. Also, an extensible experiment suite for state-of-the-art schema matching …☆90Updated 2 months ago
- ☆60Updated 2 months ago
- FDX, SIGMOD 2020☆19Updated last year
- ☆79Updated 2 years ago
- A Benchmark for Joint Data Cleaning and Machine Learning☆49Updated last year
- DBEst, AQP engine☆18Updated last year
- A Generalized Data Cleaning System☆50Updated 9 years ago
- Rheem - a cross-platform data processing system☆5Updated 3 years ago
- Distributed Temporal Graph Analytics with Apache Flink☆248Updated this week
- A System for Optimized Semantic Computation☆124Updated last week
- Repository with an overview of the tutorial on Models and Practice of Neural Table Representations and up to date material for the hands-…☆21Updated 2 years ago
- ☆192Updated last year
- Data-Centric What-If Analysis for Native Machine Learning Pipelines☆16Updated 2 years ago
- This repository provides data and scripts to use Sherlock, a DL-based model for semantic data type detection: https://sherlock.media.mit.…☆167Updated last year
- Interactive-Speed Analytics: 200x Faster, 200x Fewer Cluster Resources, Approximate Query Processing☆250Updated 4 years ago
- Applications and APIs from Oracle Graph☆51Updated last month
- An open source, high scalability toolkit in Java for Entity Resolution.☆218Updated 3 weeks ago
- simialrity join or search on spark core directly☆27Updated 5 years ago
- Deep Web Crawler for Data Enrichment☆30Updated 2 years ago
- Reference implementations for LDBC Social Network Benchmark's Interactive workload.☆105Updated 6 months ago
- ☆19Updated 3 years ago
- An experimental Graph Streaming API for Apache Flink☆141Updated 4 years ago
- A SQL Query Similarity Metric Benchmark☆15Updated 7 years ago
- Jenga is an experimentation library that allows data science practititioners and researchers to study the effect of common data corruptio…☆41Updated 2 years ago
- Benchmark code for comparing different databases☆12Updated last year
- SkinnerDB is an analytical database management system. It uses adaptive processing and reinforcement learning to find near-optimal join o…☆49Updated last year
- ☆9Updated last year
- Synthetic graph generator for the LDBC Social Network Benchmark, running on Spark☆174Updated 3 months ago
- The source repository of the Metanome tool☆184Updated 2 months ago