Data-Centric Pipelines and Data Versioning
β6,297Feb 3, 2025Updated last year
Alternatives and similar repositories for pachyderm
Users that are interested in pachyderm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Machine Learning Toolkit for Kubernetesβ15,628Updated this week
- π¦ Data Versioning and ML Experimentsβ15,586Apr 28, 2026Updated last week
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycleβ3,706Apr 26, 2026Updated 2 weeks ago
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning modelsβ4,747Mar 23, 2026Updated last month
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visβ¦β18,716Updated this week
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Workflow Engine for Kubernetesβ16,668Updated this week
- Build, Manage and Deploy AI/ML Systemsβ10,078May 5, 2026Updated last week
- High-Performance Serverless event and data processing platformβ5,710Apr 28, 2026Updated 2 weeks ago
- Dynamic, resilient AI orchestration. Coordinate data, models, and compute as you build AI workflows.β7,019Updated this week
- The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, aβ¦β25,831Updated this week
- Production infrastructure for machine learning at scaleβ8,020Jun 12, 2024Updated last year
- high-performance graph database for real-time use casesβ21,668May 5, 2026Updated last week
- Parallel computing with task schedulingβ13,826Updated this week
- π Parameterize, execute, and analyze notebooksβ6,443Apr 6, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.β22,329Updated this week
- The Open Source Feature Store for AI/MLβ7,023Updated this week
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering andβ¦β10,861Updated this week
- Fast, efficient, and scalable distributed map/reduce system, DAG execution, in memory or on disk, written in pure Go, runs standalone or β¦β3,558Mar 24, 2026Updated last month
- An orchestration platform for the development, production, and observation of data assets.β15,469Updated this week
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.β42,442Updated this week
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflowsβ45,293Updated this week
- An open-source graph databaseβ15,042Updated this week
- Kafka implemented in Golang with built-in coordination (No ZK dep, single binary install, Cloud Native)β5,009Nov 13, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A crazy fast analytical database, built on bitmaps. Perfect for ML applications. Learn more at: http://docs.featurebase.com/. Start a Docβ¦β2,526Feb 21, 2024Updated 2 years ago
- Glow is an easy-to-use distributed computation system written in Go, similar to Hadoop Map Reduce, Spark, Flink, Storm, etc. I am also woβ¦β3,223Nov 2, 2018Updated 7 years ago
- Easy and Repeatable Kubernetes Developmentβ15,821Updated this week
- Always know what to expect from your data.β11,467Updated this week
- CockroachDB β the cloud native, distributed SQL database designed for high availability, effortless scale, and control over data placemenβ¦β32,121May 4, 2026Updated last week
- Gorgonia is a library that helps facilitate machine learning in Go.β5,914Aug 12, 2024Updated last year
- OpenFaaS - Serverless Functions Made Simpleβ26,156Apr 1, 2026Updated last month
- Apache Superset is a Data Visualization and Data Exploration Platformβ72,769Updated this week
- Open Source ML Model Versioning, Metadata, and Experiment Managementβ1,746Jul 23, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.β28,559Updated this week
- Quilt is a Scientific Data Management Platform on AWS that helps teams and AI find, trust, and reuse data through deeply versioned, conteβ¦β1,364Updated this week
- The versioned, forkable, syncable databaseβ7,428Aug 27, 2021Updated 4 years ago
- Gonum is a set of numeric libraries for the Go programming language. It contains libraries for matrices, statistics, optimization, and moβ¦β8,364May 4, 2026Updated last week
- A next-generation curated knowledge sharing platform for data scientists and other technical professions.β5,535Sep 4, 2024Updated last year
- Fancy stream processing made operationally mundaneβ8,657May 2, 2026Updated last week
- The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!β8,624May 4, 2026Updated last week