stefan-grafberger / mlwhatifLinks
Data-Centric What-If Analysis for Native Machine Learning Pipelines
☆16Updated 2 years ago
Alternatives and similar repositories for mlwhatif
Users that are interested in mlwhatif are comparing it to the libraries listed below
Sorting:
- A System for Optimized Semantic Computation☆127Updated this week
- ☆60Updated 2 months ago
- Inspect ML Pipelines in Python in the form of a DAG☆70Updated last year
- Paper list about adopting machine learning techniques into data management tasks.☆37Updated 5 years ago
- ☆19Updated 3 years ago
- A Benchmark for Joint Data Cleaning and Machine Learning☆49Updated last year
- Large scale graph learning on a single machine.☆164Updated 5 months ago
- FDX, SIGMOD 2020☆19Updated last year
- ☆24Updated 2 months ago
- SkinnerDB is an analytical database management system. It uses adaptive processing and reinforcement learning to find near-optimal join o…☆49Updated last year
- ☆9Updated last year
- Jenga is an experimentation library that allows data science practititioners and researchers to study the effect of common data corruptio…☆41Updated 2 years ago
- Reference implementations for the LDBC Social Network Benchmark's Business Intelligence (BI) workload☆43Updated 4 months ago
- Source code for several Metanome data profiling algorithms☆56Updated 2 years ago
- A prototype implementation of Bao for PostgreSQL☆204Updated 10 months ago
- Implementation of DeepDB: Learn from Data, not from Queries!☆102Updated 2 years ago
- Dumpy: A Compact and Adaptive Index for Large Data Series Collections (SIGMOD'23)☆12Updated last year
- Code and Benchmarks for JOSIE (SIGMOD 2019)☆19Updated 2 years ago
- The DSB benchmark is designed for evaluating both workloaddriven and traditional database systems on modern decision support workloads. D…☆62Updated 9 months ago
- Labelled Subgraph Query Benchmark – A lightweight benchmark suite focusing on subgraph matching queries. Note: This is a microbenchmark f…☆34Updated 2 months ago
- Rich and fast user-defined functions in relational databases☆14Updated 2 years ago
- DuckDB is an in-process SQL OLAP Database Management System☆44Updated 3 weeks ago
- Characterization of relational table embeddings (VLDB 2024).☆30Updated last year
- Repository with an overview of the tutorial on Models and Practice of Neural Table Representations and up to date material for the hands-…☆21Updated 2 years ago
- Factorized Incremental View Maintenance for Queries and Analytics☆21Updated 2 weeks ago
- Pollock is a benchmark for data loading on character-delimited files.☆20Updated 3 months ago
- The BART Project: Benchmarking Algorithms for (data) Repairing and Translation☆42Updated last year
- ☆11Updated 2 years ago
- A tool facilitating matching for any dataset discovery method. Also, an extensible experiment suite for state-of-the-art schema matching …☆90Updated 2 months ago
- ⚡ Faster similarity search with PDX: A vertical data layout for vectors☆50Updated 2 weeks ago