stefan-grafberger / mlwhatifLinks
Data-Centric What-If Analysis for Native Machine Learning Pipelines
☆16Updated 2 years ago
Alternatives and similar repositories for mlwhatif
Users that are interested in mlwhatif are comparing it to the libraries listed below
Sorting:
- ☆60Updated 5 months ago
- A System for Optimized Semantic Computation☆164Updated last week
- Inspect ML Pipelines in Python in the form of a DAG☆70Updated last year
- Code and Benchmarks for JOSIE (SIGMOD 2019)☆18Updated 2 years ago
- A Benchmark for Joint Data Cleaning and Machine Learning☆49Updated last year
- Paper list about adopting machine learning techniques into data management tasks.☆37Updated 5 years ago
- A tool facilitating matching for any dataset discovery method. Also, an extensible experiment suite for state-of-the-art schema matching …☆97Updated last month
- Large scale graph learning on a single machine.☆165Updated 8 months ago
- Jenga is an experimentation library that allows data science practititioners and researchers to study the effect of common data corruptio…☆41Updated 2 years ago
- Repository with an overview of the tutorial on Models and Practice of Neural Table Representations and up to date material for the hands-…☆21Updated 2 years ago
- Source code for several Metanome data profiling algorithms☆59Updated 2 years ago
- FDX, SIGMOD 2020☆19Updated last year
- ☆20Updated 3 years ago
- Characterization of relational table embeddings (VLDB 2024).☆32Updated last year
- ☆103Updated 3 years ago
- SkinnerDB is an analytical database management system. It uses adaptive processing and reinforcement learning to find near-optimal join o…☆51Updated last year
- ☆24Updated 5 months ago
- Reference implementations for the LDBC Social Network Benchmark's Business Intelligence (BI) workload☆44Updated 7 months ago
- A collection of resources on dynamic/streaming/temporal/evolving graph processing systems, databases, data structures, datasets, and rela…☆143Updated 2 years ago
- DuckDB is an in-process SQL OLAP Database Management System☆47Updated 3 weeks ago
- Dias: Dynamic Rewriting of Pandas Code☆79Updated 4 months ago
- ☆31Updated 3 years ago
- Apache datasketches☆37Updated 3 months ago
- state-of-the-art search over vector embeddings and structured data (SIGMOD '24)☆90Updated 8 months ago
- Labelled Subgraph Query Benchmark – A lightweight benchmark suite focusing on subgraph matching queries. Note: This is a microbenchmark f…☆35Updated 6 months ago
- Dumpy: A Compact and Adaptive Index for Large Data Series Collections (SIGMOD'23)☆13Updated last year
- ☆27Updated 3 years ago
- Benchmarking Semantic Query Processing Engines☆34Updated 2 weeks ago
- ⚡ Faster similarity search with PDX: A vertical data layout for vectors☆60Updated 2 months ago
- A library of algorithms for approximate nearest neighbor search in high dimensions, along with a set of useful tools for designing such a…☆172Updated last month