stanford-futuredata / omg
☆20Updated last year
Related projects ⓘ
Alternatives and complementary repositories for omg
- Template-based generation of DAG cards from Metaflow classes, inspired by Google cards for machine learning models.☆30Updated 2 years ago
- Jenga is an experimentation library that allows data science practititioners and researchers to study the effect of common data corruptio…☆35Updated last year
- Model Validation Toolkit is a collection of tools to assist with validating machine learning models prior to deploying them to production…☆29Updated 11 months ago
- Foundation Models for Data Tasks☆100Updated last year
- Inspect ML Pipelines in Python in the form of a DAG☆69Updated 8 months ago
- openclean - Data Cleaning and data profiling library for Python☆69Updated 3 years ago
- Unified slicing for all Python data structures.☆36Updated 8 months ago
- Learn2Clean: Optimizing the Sequence of Tasks for Data Preparation and Cleaning☆50Updated last year
- Data Cleaning for ML under the Certain Prediction Framework☆11Updated 2 years ago
- ☆30Updated 2 years ago
- machine learning model performance metrics & charts with confidence intervals, optimized with numba to be fast☆16Updated 2 years ago
- Hyperparameter tuning via uncertainty modeling☆46Updated 6 months ago
- AutoBazaar: An AutoML System from the Machine Learning Bazaar☆32Updated 3 years ago
- ☆29Updated 3 years ago
- SPEAR: Programmatically label and build training data quickly.☆103Updated 4 months ago
- this repo might get accepted☆29Updated 3 years ago
- Record matching and entity resolution at scale in Spark☆31Updated last year
- ☆19Updated 6 months ago
- Python package for deduplication/entity resolution using active learning☆78Updated 2 months ago
- example how to perform distributed bayesian optimisation (autoML) using optuna on metaflow☆10Updated 3 years ago
- ☆26Updated 3 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆56Updated last year
- Ludwig benchmark☆19Updated 2 years ago
- A repository containing the Jupyter notebook code generation benchmark.☆57Updated 2 years ago
- ☆12Updated 4 years ago
- Repository for the ML Technology Readiness Levels framework☆35Updated 4 months ago
- A library of Reversible Data Transforms☆121Updated this week
- Tabular In-Context Learning☆26Updated last month
- A simple converter from SpaCy Entities (Spans) to Huggingface BILOU formatted data (tokens and ner_tags)☆13Updated last month