Data-Centric-AI-Community / awesome-data-centric-ai
Open-Source Software, Tutorials, and Research on Data-Centric AI π€
β331Updated last year
Alternatives and similar repositories for awesome-data-centric-ai:
Users that are interested in awesome-data-centric-ai are comparing it to the libraries listed below
- Data Quality assessment with one line of codeβ435Updated this week
- Curated list of open source tooling for data-centric AI on unstructured data.β713Updated last year
- Frouros: an open-source Python library for drift detection in machine learning systems.β210Updated last month
- This repository provides a curated list of references about Machine Learning Model Governance, Ethics, and Responsible AI.β114Updated 11 months ago
- Streamline scikit-learn model comparison.β146Updated 2 years ago
- Resources for Data Centric AIβ1,107Updated last year
- Tutorials for YData's Fabric platformβ31Updated this week
- π² A curated list of MLOps projects, tools and resourcesβ186Updated 11 months ago
- OmniXAI: A Library for eXplainable AIβ906Updated 8 months ago
- A novel approach for synthesizing tabular data using pretrained large language modelsβ303Updated 4 months ago
- Data search & enrichment library for Machine Learning β Easily find and add relevant features to your ML & AI pipeline from hundreds of pβ¦β327Updated this week
- A curated list of awesome open source tools and commercial products for monitoring data quality, monitoring model performance, and profilβ¦β74Updated 10 months ago
- Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.β129Updated last year
- Metrics to evaluate quality and efficacy of synthetic datasets.β227Updated this week
- Benchmarking synthetic data generation methods.β270Updated this week
- Editing machine learning models to reflect human knowledge and valuesβ124Updated last year
- The Fuzzy Labs guide to the universe of open source MLOpsβ459Updated 8 months ago
- CARLA: A Python Library to Benchmark Algorithmic Recourse and Counterfactual Explanation Algorithmsβ286Updated last year
- Coarse-grained lineage and tracing for machine learning pipelines.β467Updated 2 years ago
- nannyml: post-deployment data science in pythonβ2,039Updated 2 months ago
- The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to sβ¦β696Updated last week
- Explore and compare 1K+ accurate decision trees in your browser!β159Updated last year
- WeightedSHAP: analyzing and improving Shapley based feature attributions (NeurIPS 2022)β160Updated 2 years ago
- A curated list of awesome academic research, books, code of ethics, data sets, institutes, maturity models, newsletters, principles, podcβ¦β68Updated this week
- β Eurybia monitors model drift over time and securizes model deployment with data validationβ206Updated 4 months ago
- Drift Detection for your PyTorch Modelsβ315Updated 2 years ago
- Use advanced feature engineering strategies and select best features from your data set with a single line of code. Created by Ram Seshadβ¦β630Updated last month
- Algorithms for outlier, adversarial and drift detectionβ2,322Updated this week
- π§ͺ Simple data science experimentation & tracking with jupyter, papermill, and mlflow.β180Updated 8 months ago
- Uncertainty Quantification 360 (UQ360) is an extensible open-source toolkit that can help you estimate, communicate and use uncertainty iβ¦β259Updated 7 months ago