Jenga is an experimentation library that allows data science practititioners and researchers to study the effect of common data corruptions (e.g., missing values, broken character encodings) on the prediction quality of their ML models.
☆43Jun 21, 2023Updated 3 years ago
Alternatives and similar repositories for jenga
Users that are interested in jenga are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data-Centric What-If Analysis for Native Machine Learning Pipelines☆16Jun 14, 2023Updated 3 years ago
- Code for extracting, parsing and annotating tables from GitTables (https://gittables.github.io).☆50Apr 7, 2026Updated 2 months ago
- Imputation of missing values in tables.☆492Jan 14, 2026Updated 5 months ago
- TREC-COVID results - this is a mirror of data on the TREC website in a more convenient format.☆15Aug 31, 2020Updated 5 years ago
- ☆28Feb 2, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Automated data quality suggestions and analysis with Deequ on AWS Glue☆93Dec 29, 2022Updated 3 years ago
- The monorepo that powers the GreenDB.☆26Nov 27, 2023Updated 2 years ago
- ☆12Jul 8, 2024Updated last year
- Sharing how to deploy AutoML model into Teradata DB☆21Mar 20, 2025Updated last year
- Machine Learning Tool Box☆28Dec 7, 2023Updated 2 years ago
- ☆11Jul 21, 2022Updated 3 years ago
- Implementation of Google Dremel's storage engine in a custom in-memory DB with query compilation.☆14Oct 10, 2020Updated 5 years ago
- ☆12Jun 10, 2026Updated 2 weeks ago
- Session-based recommender system: Serenade☆81Nov 6, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Lab tasks for the course on "Data Engineering for Machine Learning"☆10May 1, 2023Updated 3 years ago
- ZSH plugin that automagically detects and activates your python environments (poetry, virtualenv, conda) while traversing directories.☆31Jan 27, 2024Updated 2 years ago
- ☆17Aug 8, 2023Updated 2 years ago
- Python library for training a covariate shift estimator☆13Feb 27, 2019Updated 7 years ago
- Data analysis of https://www.kaggle.com/mylesoneill/world-university-rankings☆12Sep 23, 2020Updated 5 years ago
- python framework for writing robot strategies☆17Jun 1, 2018Updated 8 years ago
- A Python Snowpark CLI for loading the TPC-DI dataset into Snowflake. Additional dbt models for building the data warehouse.☆11Sep 4, 2025Updated 9 months ago
- (ICML 2021) Mandoline: Model Evaluation under Distribution Shift☆30Jun 14, 2021Updated 5 years ago
- Skewed Data Generator for TPC-H☆13Apr 7, 2020Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Assigns Elo Ratings to Python Global Optimizers☆24Apr 3, 2024Updated 2 years ago
- Benchmarking Semantic Query Processing Engines☆59Jun 8, 2026Updated 3 weeks ago
- ☆13Jul 25, 2024Updated last year
- Dissertation (Jeff Heaton)☆10Oct 10, 2019Updated 6 years ago
- Slides from my talk on spaCy IRL, regarding sparse attention.☆12Jul 9, 2019Updated 6 years ago
- Code for the AISTATS 2024 Paper "From Data Imputation to Data Cleaning - Automated Cleaning of Tabular Data Improves Downstream Predictiv…☆24Feb 14, 2024Updated 2 years ago
- queries for mimic-iv☆11Jul 2, 2021Updated 4 years ago
- BinDex: A Two-Layered Index for Fast and Robust Scans (SIGMOD2020)☆10Jun 5, 2020Updated 6 years ago
- Code for Episodic Memory Reader (EMR) https://arxiv.org/abs/1903.06164☆15Nov 16, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A library for open domain query facet extraction and generation☆16Apr 24, 2024Updated 2 years ago
- Repository with an overview of the tutorial on Models and Practice of Neural Table Representations and up to date material for the hands-…☆21Jun 29, 2023Updated 3 years ago
- Public knowledge, help, code, contributions and projects about how to work with Softbank Robotics' Pepper robot model☆14Oct 8, 2019Updated 6 years ago
- Open Targets Library ETL Pipeline | Apache Beam☆16May 5, 2021Updated 5 years ago
- INSCIT: Information-Seeking Conversations with Mixed-Initiative Interactions☆16Jan 21, 2025Updated last year
- Bid, launch and manage your EC2 Spot instances.☆23Dec 24, 2013Updated 12 years ago
- Applying "Load What You Need: Smaller Versions of Multilingual BERT" to LaBSE☆20Sep 22, 2021Updated 4 years ago