Data-Centric Pipelines and Data Versioning
β6,297Feb 3, 2025Updated last year
Alternatives and similar repositories for pachyderm
Users that are interested in pachyderm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Machine Learning Toolkit for Kubernetesβ15,586Jan 5, 2026Updated 3 months ago
- π¦ Data Versioning and ML Experimentsβ15,551Apr 14, 2026Updated last week
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycleβ3,703Updated this week
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning modelsβ4,743Mar 23, 2026Updated 3 weeks ago
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visβ¦β18,703Apr 10, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Build, Manage and Deploy AI/ML Systemsβ10,040Updated this week
- Workflow Engine for Kubernetesβ16,620Updated this week
- High-Performance Serverless event and data processing platformβ5,699Apr 14, 2026Updated last week
- Dynamic, resilient AI orchestration. Coordinate data, models, and compute as you build AI workflows. Flyte 2 now available locally: httpsβ¦β6,941Updated this week
- The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, aβ¦β25,431Updated this week
- Production infrastructure for machine learning at scaleβ8,021Jun 12, 2024Updated last year
- high-performance graph database for real-time use casesβ21,665Updated this week
- Parallel computing with task schedulingβ13,804Apr 13, 2026Updated last week
- π Parameterize, execute, and analyze notebooksβ6,431Apr 6, 2026Updated 2 weeks ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.β22,201Updated this week
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering andβ¦β10,834Updated this week
- The Open Source Feature Store for AI/MLβ6,970Updated this week
- Fast, efficient, and scalable distributed map/reduce system, DAG execution, in memory or on disk, written in pure Go, runs standalone or β¦β3,560Mar 24, 2026Updated 3 weeks ago
- An orchestration platform for the development, production, and observation of data assets.β15,348Updated this week
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.β42,165Updated this week
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflowsβ45,062Updated this week
- An open-source graph databaseβ15,042Nov 22, 2025Updated 5 months ago
- Kafka implemented in Golang with built-in coordination (No ZK dep, single binary install, Cloud Native)β5,012Nov 13, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A crazy fast analytical database, built on bitmaps. Perfect for ML applications. Learn more at: http://docs.featurebase.com/. Start a Docβ¦β2,527Feb 21, 2024Updated 2 years ago
- Glow is an easy-to-use distributed computation system written in Go, similar to Hadoop Map Reduce, Spark, Flink, Storm, etc. I am also woβ¦β3,223Nov 2, 2018Updated 7 years ago
- Easy and Repeatable Kubernetes Developmentβ15,807Updated this week
- Always know what to expect from your data.β11,422Updated this week
- CockroachDB β the cloud native, distributed SQL database designed for high availability, effortless scale, and control over data placemenβ¦β32,072Updated this week
- Gorgonia is a library that helps facilitate machine learning in Go.β5,913Aug 12, 2024Updated last year
- OpenFaaS - Serverless Functions Made Simpleβ26,142Apr 1, 2026Updated 2 weeks ago
- Apache Superset is a Data Visualization and Data Exploration Platformβ72,481Updated this week
- Open Source ML Model Versioning, Metadata, and Experiment Managementβ1,746Jul 23, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.β28,473Apr 13, 2026Updated last week
- Quilt is a Scientific Data Management Platform on AWS that helps teams and AI find, trust, and reuse data through deeply versioned, conteβ¦β1,363Updated this week
- The versioned, forkable, syncable databaseβ7,428Aug 27, 2021Updated 4 years ago
- Gonum is a set of numeric libraries for the Go programming language. It contains libraries for matrices, statistics, optimization, and moβ¦β8,356Apr 15, 2026Updated last week
- A next-generation curated knowledge sharing platform for data scientists and other technical professions.β5,539Sep 4, 2024Updated last year
- Fancy stream processing made operationally mundaneβ8,643Updated this week
- The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!β8,580Apr 13, 2026Updated last week