Data-Centric-AI-Community / awesome-data-centric-ai
Open-Source Software, Tutorials, and Research on Data-Centric AI π€
β319Updated 10 months ago
Related projects β
Alternatives and complementary repositories for awesome-data-centric-ai
- Data Quality assessment with one line of codeβ426Updated this week
- Frouros: an open-source Python library for drift detection in machine learning systems.β192Updated 3 weeks ago
- OmniXAI: A Library for eXplainable AIβ874Updated 3 months ago
- Curated list of open source tooling for data-centric AI on unstructured data.β699Updated 11 months ago
- Streamline scikit-learn model comparison.β146Updated last year
- β Eurybia monitors model drift over time and securizes model deployment with data validationβ205Updated 2 weeks ago
- Editing machine learning models to reflect human knowledge and valuesβ123Updated last year
- Data search & enrichment library for Machine Learning β Easily find and add relevant features to your ML & AI pipeline from hundreds of pβ¦β318Updated this week
- Explainable AI framework for data scientists. Explain & debug any blackbox machine learning model with a single line of code. We are lookβ¦β417Updated 2 months ago
- Metrics to evaluate quality and efficacy of synthetic datasets.β211Updated this week
- Fast SHAP value computation for interpreting tree-based modelsβ521Updated last year
- Tutorials for YData's Fabric platformβ32Updated this week
- π Minimal examples of machine learning tests for implementation, behaviour, and performance.β254Updated 2 years ago
- Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.β126Updated 10 months ago
- Experiments on Tabular Data Modelsβ268Updated last year
- The Fuzzy Labs guide to the universe of open source MLOpsβ448Updated 3 months ago
- A curated list of awesome open source tools and commercial products for monitoring data quality, monitoring model performance, and profilβ¦β64Updated 6 months ago
- Resources for Data Centric AIβ1,100Updated 10 months ago
- Interpret Community extends Interpret repository with additional interpretability techniques and utility functions to handle real-world dβ¦β421Updated 5 months ago
- Benchmarking synthetic data generation methods.β262Updated this week
- A series of Terraform based recipes to provision popular MLOps stacks on the cloud.β247Updated last month
- Synthetic data generators for structured and unstructured text, featuring differentially private learning.β590Updated last week
- WeightedSHAP: analyzing and improving Shapley based feature attributions (NeurIPS 2022)β160Updated 2 years ago
- A python package for simultaneous Hyperparameters Tuning and Features Selection for Gradient Boosting Models.β566Updated 5 months ago
- Lab assignments for Introduction to Data-Centric AI, MIT IAP 2024 π©π½βπ»β431Updated 10 months ago
- A novel approach for synthesizing tabular data using pretrained large language modelsβ281Updated last week
- A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.β212Updated 3 weeks ago
- Build Low Code Automated Tensorflow explainable models in just 3 lines of code. Library created by: Hasan Rafiq - https://www.linkedin.coβ¦β181Updated last year
- Drift Detection for your PyTorch Modelsβ312Updated 2 years ago
- This repository provides a curated list of references about Machine Learning Model Governance, Ethics, and Responsible AI.β99Updated 6 months ago