π¦ Data Versioning and ML Experiments
β15,385Feb 16, 2026Updated last week
Alternatives and similar repositories for dvc
Users that are interested in dvc are comparing it to the libraries listed below
Sorting:
- βΎοΈ CML - Continuous Machine Learning | CI/CD for MLβ4,168Jun 2, 2025Updated 8 months ago
- The open source developer platform to build AI agents and models with confidence. Enhance your AI applications with end-to-end tracking, β¦β24,365Updated this week
- Build, Manage and Deploy AI/ML Systemsβ9,863Updated this week
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.β41,516Updated this week
- Always know what to expect from your data.β11,162Feb 20, 2026Updated last week
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering andβ¦β10,768Updated this week
- Streamlit β A faster way to build and share data apps.β43,634Updated this week
- Parallel computing with task schedulingβ13,746Updated this week
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.β21,652Feb 21, 2026Updated last week
- The Open Source Feature Store for AI/MLβ6,737Updated this week
- Modin: Scale your Pandas workflows by changing a single line of codeβ10,362Feb 10, 2026Updated 2 weeks ago
- The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!β8,472Updated this week
- Data-Centric Pipelines and Data Versioningβ6,286Feb 3, 2025Updated last year
- Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.β30,860Feb 21, 2026Updated last week
- Machine Learning Toolkit for Kubernetesβ15,462Jan 5, 2026Updated last month
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and moreβ34,940Updated this week
- Hydra is a framework for elegantly configuring complex applicationsβ10,219Feb 7, 2026Updated 3 weeks ago
- A game theoretic approach to explain the output of any machine learning model.β25,072Feb 20, 2026Updated last week
- A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learningβ20,174Feb 9, 2026Updated 2 weeks ago
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per sβ¦β8,475Feb 5, 2026Updated 3 weeks ago
- π Parameterize, execute, and analyze notebooksβ6,388Jan 5, 2026Updated last month
- An orchestration platform for the development, production, and observation of data assets.β15,007Updated this week
- ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling β¦β6,532Updated this week
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning modelsβ4,730Feb 16, 2026Updated last week
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycleβ3,697Feb 21, 2026Updated last week
- Extremely fast Query Engine for DataFrames, written in Rustβ37,513Updated this week
- Python packaging and dependency management made easyβ34,279Updated this week
- A system for quickly generating training data with weak supervisionβ5,940May 2, 2024Updated last year
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.β14,675Dec 1, 2025Updated 2 months ago
- Low-code framework for building custom LLMs, neural networks, and other AI modelsβ11,651Updated this week
- π« Industrial-strength Natural Language Processing (NLP) in Pythonβ33,254Nov 27, 2025Updated 3 months ago
- Production infrastructure for machine learning at scaleβ8,031Jun 12, 2024Updated last year
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflowsβ44,430Updated this week
- Label Studio is a multi-type data labeling and annotation tool with standardized output formatβ26,505Updated this week
- An open source python library for automated feature engineeringβ7,614Feb 3, 2026Updated 3 weeks ago
- A hyperparameter optimization frameworkβ13,583Updated this week
- Evidently is ββan open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. Froβ¦β7,227Updated this week
- Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.β6,754Feb 21, 2026Updated last week
- 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.β13,389Feb 2, 2026Updated 3 weeks ago