Data-Centric-AI-Community / awesome-data-centric-ai
Open-Source Software, Tutorials, and Research on Data-Centric AI π€
β331Updated last year
Alternatives and similar repositories for awesome-data-centric-ai:
Users that are interested in awesome-data-centric-ai are comparing it to the libraries listed below
- Curated list of open source tooling for data-centric AI on unstructured data.β709Updated last year
- Frouros: an open-source Python library for drift detection in machine learning systems.β208Updated 3 weeks ago
- Data Quality assessment with one line of codeβ434Updated 3 weeks ago
- A curated list of awesome open source tools and commercial products for monitoring data quality, monitoring model performance, and profilβ¦β72Updated 9 months ago
- Streamline scikit-learn model comparison.β146Updated 2 years ago
- Data search & enrichment library for Machine Learning β Easily find and add relevant features to your ML & AI pipeline from hundreds of pβ¦β326Updated this week
- β Eurybia monitors model drift over time and securizes model deployment with data validationβ206Updated 3 months ago
- Benchmarking synthetic data generation methods.β267Updated this week
- A series of Terraform based recipes to provision popular MLOps stacks on the cloud.β253Updated 4 months ago
- π² A curated list of MLOps projects, tools and resourcesβ186Updated 9 months ago
- The Fuzzy Labs guide to the universe of open source MLOpsβ455Updated 7 months ago
- Metrics to evaluate quality and efficacy of synthetic datasets.β222Updated this week
- Synthetic data generators for structured and unstructured text, featuring differentially private learning.β615Updated last week
- Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.β129Updated last year
- A library for debugging/inspecting machine learning classifiers and explaining their predictionsβ274Updated last month
- π Minimal examples of machine learning tests for implementation, behaviour, and performance.β260Updated 2 years ago
- The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to sβ¦β691Updated last month
- WeightedSHAP: analyzing and improving Shapley based feature attributions (NeurIPS 2022)β160Updated 2 years ago
- Learn how to create reliable ML systems by testing code, data and models.β86Updated 2 years ago
- Compare MLOps Platforms. Breakdowns of SageMaker, VertexAI, AzureML, Dataiku, Databricks, h2o, kubeflow, mlflow...β390Updated 2 years ago
- A library of Reversible Data Transformsβ123Updated this week
- Weakly Supervised End-to-End Learning (NeurIPS 2021)β157Updated last year
- Resources for Data Centric AIβ1,108Updated last year
- Explore and compare 1K+ accurate decision trees in your browser!β159Updated 11 months ago
- Experiments on Tabular Data Modelsβ272Updated last year
- Automatically build ARIMA, SARIMAX, VAR, FB Prophet and XGBoost Models on Time Series data sets with a Single Line of Code. Created by Raβ¦β745Updated 6 months ago
- Tutorials on creating a reproducible and maintainable data science projectβ142Updated 2 years ago
- πΆ A tool to package, serve, and deploy any ML model on any platform. Archived to be resurrected one dayπ€β718Updated last year
- A kedro-plugin for integration of mlflow capabilities inside kedro projects (especially machine learning model versioning and packaging)β207Updated this week
- Monitor the stability of a Pandas or Spark dataframe βοΈβ498Updated 3 weeks ago