stefan-grafberger / mlwhatif
Data-Centric What-If Analysis for Native Machine Learning Pipelines
☆15Updated last year
Related projects ⓘ
Alternatives and complementary repositories for mlwhatif
- Inspect ML Pipelines in Python in the form of a DAG☆69Updated 8 months ago
- Explaining Inference Queries with Bayesian Optimization☆10Updated 3 years ago
- Jenga is an experimentation library that allows data science practititioners and researchers to study the effect of common data corruptio…☆35Updated last year
- ☆19Updated 2 years ago
- Code for extracting, parsing and annotating tables from GitTables (https://gittables.github.io).☆41Updated 2 years ago
- Paper list about adopting machine learning techniques into data management tasks.☆37Updated 4 years ago
- Large scale graph learning on a single machine.☆161Updated 2 months ago
- Repository with an overview of the tutorial on Models and Practice of Neural Table Representations and up to date material for the hands-…☆18Updated last year
- Reference implementations for the LDBC Social Network Benchmark's Business Intelligence (BI) workload☆36Updated 2 months ago
- FDX, SIGMOD 2020☆19Updated 6 months ago
- Code and Benchmarks for JOSIE (SIGMOD 2019)☆18Updated last year
- A Declarative System for Optimizing AI Workloads☆53Updated last week
- SkinnerDB is an analytical database management system. It uses adaptive processing and reinforcement learning to find near-optimal join o…☆47Updated 8 months ago
- DuckDB is an in-process SQL OLAP Database Management System☆39Updated this week
- A Python-to-SQL transpiler as replacement for Python Pandas☆47Updated last year
- A tool facilitating matching for any dataset discovery method. Also, an extensible experiment suite for state-of-the-art schema matching …☆84Updated 3 weeks ago
- A Benchmark for Joint Data Cleaning and Machine Learning☆44Updated 5 months ago
- Labelled Subgraph Query Benchmark – A lightweight benchmark suite focusing on subgraph matching queries. Note: This is a microbenchmark f…☆27Updated last month
- ☆49Updated 2 months ago
- Foundation Models for Data Tasks☆100Updated last year
- Dias: Dynamic Rewriting of Pandas Code☆54Updated this week
- ☆25Updated 6 years ago
- Implementation of DeepDB: Learn from Data, not from Queries!☆92Updated last year
- Data System for Optimized Deep Learning Model Selection☆20Updated 2 years ago
- A Jupyter notebook extension to centralize and manage data☆14Updated last year
- A library of algorithms for approximate nearest neighbor search in high dimensions, along with a set of useful tools for designing such a…☆119Updated this week
- ☆126Updated 2 weeks ago
- ☆20Updated this week
- GraphMineSuite (GMS): a benchmarking suite for graph mining algorithms such as graph pattern matching or graph learning☆25Updated 3 years ago
- ☆42Updated last year