Data-Centric Pipelines and Data Versioning
β6,291Feb 3, 2025Updated last year
Alternatives and similar repositories for pachyderm
Users that are interested in pachyderm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Machine Learning Toolkit for Kubernetesβ15,725Jun 11, 2026Updated last week
- π¦ Data Versioning and ML Experimentsβ15,675Jun 8, 2026Updated last week
- AI Infra / AI Orchestration / AI Control Planeβ3,707Updated this week
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning modelsβ4,752Mar 23, 2026Updated 2 months ago
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visβ¦β18,743Jun 13, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Build, Manage and Deploy AI/ML Systemsβ10,133Updated this week
- Workflow Engine for Kubernetesβ16,773Updated this week
- High-Performance Serverless event and data processing platformβ5,730Updated this week
- Dynamic, resilient AI orchestration. Coordinate data, models, and compute as you build AI workflows.β7,088Jun 12, 2026Updated last week
- The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, aβ¦β26,638Updated this week
- Production infrastructure for machine learning at scaleβ8,013Jun 12, 2024Updated 2 years ago
- high-performance graph database for real-time use casesβ21,700Updated this week
- Parallel computing with task schedulingβ13,846Jun 11, 2026Updated last week
- π Parameterize, execute, and analyze notebooksβ6,450May 12, 2026Updated last month
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.β22,650Updated this week
- The Open Source Feature Store for AI/MLβ7,095Updated this week
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering andβ¦β10,887Jun 12, 2026Updated last week
- Fast, efficient, and scalable distributed map/reduce system, DAG execution, in memory or on disk, written in pure Go, runs standalone or β¦β3,562May 8, 2026Updated last month
- An orchestration platform for the development, production, and observation of data assets.β15,699Updated this week
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.β42,931Updated this week
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflowsβ45,867Updated this week
- An open-source graph databaseβ15,043May 5, 2026Updated last month
- Kafka implemented in Golang with built-in coordination (No ZK dep, single binary install, Cloud Native)β5,010May 20, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A crazy fast analytical database, built on bitmaps. Perfect for ML applications. Learn more at: http://docs.featurebase.com/. Start a Docβ¦β2,526Feb 21, 2024Updated 2 years ago
- Glow is an easy-to-use distributed computation system written in Go, similar to Hadoop Map Reduce, Spark, Flink, Storm, etc. I am also woβ¦β3,219Nov 2, 2018Updated 7 years ago
- Easy and Repeatable Kubernetes Developmentβ15,845Jun 9, 2026Updated last week
- Always know what to expect from your data.β11,556Jun 13, 2026Updated last week
- CockroachDB β the cloud native, distributed SQL database designed for high availability, effortless scale, and control over data placemenβ¦β32,207Updated this week
- Gorgonia is a library that helps facilitate machine learning in Go.β5,917Aug 12, 2024Updated last year
- OpenFaaS - Serverless Functions Made Simpleβ26,174Apr 1, 2026Updated 2 months ago
- Apache Superset is a Data Visualization and Data Exploration Platformβ73,298Updated this week
- Open Source ML Model Versioning, Metadata, and Experiment Managementβ1,746Jul 23, 2024Updated last year
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.β28,648Jun 14, 2026Updated last week
- Quilt is a Scientific Data Management Platform on AWS that helps teams and AI find, trust, and reuse data through deeply versioned, conteβ¦β1,366Updated this week
- The versioned, forkable, syncable databaseβ7,422Aug 27, 2021Updated 4 years ago
- Gonum is a set of numeric libraries for the Go programming language. It contains libraries for matrices, statistics, optimization, and moβ¦β8,393May 4, 2026Updated last month
- A next-generation curated knowledge sharing platform for data scientists and other technical professions.β5,532Sep 4, 2024Updated last year
- Fancy stream processing made operationally mundaneβ8,682Updated this week
- The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!β8,676Jun 3, 2026Updated 2 weeks ago