Open-Source Software, Tutorials, and Research on Data-Centric AI 🤖
☆345Feb 10, 2026Updated last month
Alternatives and similar repositories for awesome-data-centric-ai
Users that are interested in awesome-data-centric-ai are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Resources for Data Centric AI☆1,136Dec 13, 2023Updated 2 years ago
- Synthetic data generators for tabular and time-series data☆1,617Mar 2, 2026Updated 3 weeks ago
- Curated list of open source tooling for data-centric AI on unstructured data.☆734Nov 15, 2023Updated 2 years ago
- Data Quality assessment with one line of code☆454Mar 2, 2026Updated 3 weeks ago
- A curated list of awesome resources such as books, tutorials, courses, open-source libraries, exercises, and other materials that support…☆91Jun 4, 2024Updated last year
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- A curated, but incomplete, list of data-centric AI resources.☆1,141Jun 26, 2024Updated last year
- ☆30Feb 9, 2023Updated 3 years ago
- 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.☆13,438Mar 3, 2026Updated 3 weeks ago
- Standardised Metrics and Methods for Synthetic Tabular Data Evaluation☆36Aug 14, 2024Updated last year
- Fabric SDK to interact with the Fabric platform☆22Mar 4, 2026Updated 3 weeks ago
- Dvc + Streamlit = ❤️☆40Oct 27, 2023Updated 2 years ago
- Open Source Data Annotation & Labeling Tools☆688Mar 14, 2026Updated last week
- NIST Collaborative Research Cycle on Synthetic Data. Learn about Synthetic Data week by week!☆27Jul 13, 2023Updated 2 years ago
- A copier template repository for a e2e batch ZenML MLOps pipeline.☆11Dec 17, 2025Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data …☆11,376Jan 13, 2026Updated 2 months ago
- ☆15Jul 16, 2014Updated 11 years ago
- 🎲 A curated list of MLOps projects, tools and resources☆187Apr 22, 2024Updated last year
- Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML va…☆3,999Dec 28, 2025Updated 2 months ago
- Modern development with Python in 2024☆12Mar 16, 2026Updated last week
- a catch-all repo☆11Dec 28, 2023Updated 2 years ago
- nannyml: post-deployment data science in python☆2,133Jul 12, 2025Updated 8 months ago
- ☆13May 12, 2023Updated 2 years ago
- A starter vault in Obsidian for both work and personal knowledge management, complete with seamless workflows.☆15Nov 11, 2025Updated 4 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ML REPA Library: MLOps and ML Engineering Solutions for Success☆23Jun 26, 2023Updated 2 years ago
- ZenML 🙏: One AI Platform from Pipelines to Agents. https://zenml.io.☆5,289Updated this week
- ☆12Sep 21, 2023Updated 2 years ago
- Hyperparameter tuning via uncertainty modeling☆49May 3, 2024Updated last year
- Lab assignments for Introduction to Data-Centric AI, MIT IAP 2024 👩🏽💻☆479Feb 24, 2025Updated last year
- Pytest plugin for mocking BigQuery data from the python BigQuery client.☆14Feb 6, 2023Updated 3 years ago
- cleanpy is a CLI tool to remove caches and temporary files related to Python.☆19Oct 27, 2025Updated 5 months ago
- HiPlot fetcher for experiments logged with MLflow☆14May 11, 2022Updated 3 years ago
- Covid-19 spread simulator with human mobility and intervention modeling.☆19May 28, 2022Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- A curated list of awesome resources related to Semantic Search🔎 and Semantic Similarity tasks.☆362Dec 9, 2025Updated 3 months ago
- A curated list of awesome MLOps tools☆5,064Mar 20, 2026Updated last week
- A curated list of references for MLOps☆13,821Nov 21, 2024Updated last year
- Türkiye Teknoloji Takımı Vakfı - Yapay Zeka Usta Eğitimleri Serisi - Makine Öğreniminde Regresyon ve Sınıflandırma☆17Sep 14, 2020Updated 5 years ago
- This repository aims to map the ecosystem of artificial intelligence guidelines, principles, codes of ethics, standards, regulation and b…☆1,415Mar 4, 2026Updated 3 weeks ago
- Awesome list for data journalists and future data journalists☆210Feb 26, 2026Updated last month
- ☆10Sep 11, 2020Updated 5 years ago