Data-Centric-AI-Community / awesome-data-centric-ai
Open-Source Software, Tutorials, and Research on Data-Centric AI π€
β332Updated last year
Alternatives and similar repositories for awesome-data-centric-ai:
Users that are interested in awesome-data-centric-ai are comparing it to the libraries listed below
- Frouros: an open-source Python library for drift detection in machine learning systems.β214Updated 2 months ago
- Curated list of open source tooling for data-centric AI on unstructured data.β716Updated last year
- Use advanced feature engineering strategies and select best features from your data set with a single line of code. Created by Ram Seshadβ¦β639Updated last month
- Data Quality assessment with one line of codeβ437Updated last week
- β Eurybia monitors model drift over time and securizes model deployment with data validationβ207Updated 5 months ago
- This repository provides a curated list of references about Machine Learning Model Governance, Ethics, and Responsible AI.β114Updated 11 months ago
- OmniXAI: A Library for eXplainable AIβ915Updated 8 months ago
- π Minimal examples of machine learning tests for implementation, behaviour, and performance.β262Updated 2 years ago
- Resources for Data Centric AIβ1,109Updated last year
- Synthetic data generators for structured and unstructured text, featuring differentially private learning.β629Updated 3 weeks ago
- Data search & enrichment library for Machine Learning β Easily find and add relevant features to your ML & AI pipeline from hundreds of pβ¦β330Updated this week
- Lab assignments for Introduction to Data-Centric AI, MIT IAP 2024 π©π½βπ»β450Updated last month
- Editing machine learning models to reflect human knowledge and valuesβ124Updated last year
- Streamline scikit-learn model comparison.β145Updated 2 years ago
- Generate Diverse Counterfactual Explanations for any machine learning model.β1,399Updated 4 months ago
- β475Updated 7 months ago
- Example project with a complete MLOps cycle: versioning data, generating reports on pull requests and deploying the model on releases witβ¦β48Updated 3 years ago
- Learn how to create reliable ML systems by testing code, data and models.β86Updated 2 years ago
- A curated list of awesome open source tools and commercial products for monitoring data quality, monitoring model performance, and profilβ¦β75Updated 11 months ago
- Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.β129Updated last year
- Introduction to Data-Centric AI, MIT IAP 2023 π€β98Updated 2 months ago
- Responsible AI knowledge baseβ101Updated 2 years ago
- Tutorials for YData's Fabric platformβ31Updated 3 weeks ago
- Monitor the stability of a Pandas or Spark dataframe βοΈβ500Updated 2 months ago
- MLOps Cookiecutter Template: A Base Project Structure for Secure Production ML Engineeringβ40Updated 5 months ago
- A series of Terraform based recipes to provision popular MLOps stacks on the cloud.β255Updated 6 months ago
- A book of subtle code tricks and gem resources for all things data, machine learning and deep learning.β165Updated 7 months ago
- Explainable AI framework for data scientists. Explain & debug any blackbox machine learning model with a single line of code. We are lookβ¦β430Updated 7 months ago
- π A curated list of resources dedicated to synthetic dataβ127Updated 2 years ago
- A python package for simultaneous Hyperparameters Tuning and Features Selection for Gradient Boosting Models.β577Updated 10 months ago