stefan-grafberger / mlwhatif
Data-Centric What-If Analysis for Native Machine Learning Pipelines
☆15Updated last year
Related projects: ⓘ
- Inspect ML Pipelines in Python in the form of a DAG☆68Updated 6 months ago
- Explaining Inference Queries with Bayesian Optimization☆10Updated 3 years ago
- Jenga is an experimentation library that allows data science practititioners and researchers to study the effect of common data corruptio…☆35Updated last year
- Paper list about adopting machine learning techniques into data management tasks.☆37Updated 4 years ago
- Repository with an overview of the tutorial on Models and Practice of Neural Table Representations and up to date material for the hands-…☆17Updated last year
- Code for extracting, parsing and annotating tables from GitTables (https://gittables.github.io).☆40Updated 2 years ago
- A Declarative System for Optimizing AI Workloads☆44Updated last week
- Code and Benchmarks for JOSIE (SIGMOD 2019)☆18Updated last year
- Foundation Models for Data Tasks☆99Updated last year
- Dias: Dynamic Rewriting of Pandas Code☆54Updated 3 months ago
- Large scale graph learning on a single machine.☆160Updated last week
- SkinnerDB is an analytical database management system. It uses adaptive processing and reinforcement learning to find near-optimal join o…☆47Updated 6 months ago
- Characterization of relational table embeddings (VLDB 2024).☆22Updated 2 months ago
- ☆19Updated 2 years ago
- ☆47Updated 8 months ago
- FDX, SIGMOD 2020☆18Updated 4 months ago
- Data System for Optimized Deep Learning Model Selection☆20Updated last year
- A Benchmark for Joint Data Cleaning and Machine Learning☆44Updated 3 months ago
- State-of-the-art neural cardinality estimators for join queries☆66Updated 3 years ago
- ☆30Updated 2 years ago
- Code for the paper "Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond…☆21Updated 2 years ago
- A Python-to-SQL transpiler as replacement for Python Pandas☆47Updated last year
- Implementation of DeepDB: Learn from Data, not from Queries!☆90Updated last year
- Reference implementations for the LDBC Social Network Benchmark's Business Intelligence (BI) workload☆35Updated this week
- PyTorch implementation of binary tree convolution☆45Updated 4 years ago
- GraphMineSuite (GMS): a benchmarking suite for graph mining algorithms such as graph pattern matching or graph learning☆25Updated 3 years ago
- The BART Project: Benchmarking Algorithms for (data) Repairing and Translation☆35Updated 9 months ago
- DB-BERT tunes database systems for optimal performance, using tuning hints mined from text.☆57Updated last year
- ☆13Updated 3 years ago
- Code and workloads from the Learned Cardinalities paper (https://arxiv.org/abs/1809.00677)☆113Updated 5 years ago