Data-Centric Pipelines and Data Versioning
β6,293Feb 3, 2025Updated last year
Alternatives and similar repositories for pachyderm
Users that are interested in pachyderm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Machine Learning Toolkit for Kubernetesβ15,709May 24, 2026Updated 2 weeks ago
- π¦ Data Versioning and ML Experimentsβ15,662Updated this week
- Open Source AI Infra & Engineering Control Planeβ3,707May 29, 2026Updated 2 weeks ago
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning modelsβ4,751Mar 23, 2026Updated 2 months ago
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visβ¦β18,738May 19, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Build, Manage and Deploy AI/ML Systemsβ10,114Jun 3, 2026Updated last week
- Workflow Engine for Kubernetesβ16,744Jun 5, 2026Updated last week
- High-Performance Serverless event and data processing platformβ5,729Updated this week
- Dynamic, resilient AI orchestration. Coordinate data, models, and compute as you build AI workflows.β7,075Updated this week
- The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, aβ¦β26,338Jun 5, 2026Updated last week
- Production infrastructure for machine learning at scaleβ8,014Jun 12, 2024Updated 2 years ago
- high-performance graph database for real-time use casesβ21,681Jun 4, 2026Updated last week
- Parallel computing with task schedulingβ13,849Updated this week
- π Parameterize, execute, and analyze notebooksβ6,449May 12, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.β22,560Updated this week
- The Open Source Feature Store for AI/MLβ7,085Updated this week
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering andβ¦β10,882Updated this week
- Fast, efficient, and scalable distributed map/reduce system, DAG execution, in memory or on disk, written in pure Go, runs standalone or β¦β3,562May 8, 2026Updated last month
- An orchestration platform for the development, production, and observation of data assets.β15,647Updated this week
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.β42,789Jun 6, 2026Updated last week
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflowsβ45,710Jun 6, 2026Updated last week
- An open-source graph databaseβ15,043May 5, 2026Updated last month
- Kafka implemented in Golang with built-in coordination (No ZK dep, single binary install, Cloud Native)β5,011May 20, 2026Updated 3 weeks ago
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A crazy fast analytical database, built on bitmaps. Perfect for ML applications. Learn more at: http://docs.featurebase.com/. Start a Docβ¦β2,526Feb 21, 2024Updated 2 years ago
- Glow is an easy-to-use distributed computation system written in Go, similar to Hadoop Map Reduce, Spark, Flink, Storm, etc. I am also woβ¦β3,219Nov 2, 2018Updated 7 years ago
- Easy and Repeatable Kubernetes Developmentβ15,834Jun 5, 2026Updated last week
- Always know what to expect from your data.β11,548Jun 3, 2026Updated last week
- CockroachDB β the cloud native, distributed SQL database designed for high availability, effortless scale, and control over data placemenβ¦β32,200Updated this week
- Gorgonia is a library that helps facilitate machine learning in Go.β5,915Aug 12, 2024Updated last year
- OpenFaaS - Serverless Functions Made Simpleβ26,177Apr 1, 2026Updated 2 months ago
- Apache Superset is a Data Visualization and Data Exploration Platformβ73,212Updated this week
- Open Source ML Model Versioning, Metadata, and Experiment Managementβ1,746Jul 23, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.β28,621Jun 1, 2026Updated last week
- Quilt is a Scientific Data Management Platform on AWS that helps teams and AI find, trust, and reuse data through deeply versioned, conteβ¦β1,364Updated this week
- The versioned, forkable, syncable databaseβ7,421Aug 27, 2021Updated 4 years ago
- Gonum is a set of numeric libraries for the Go programming language. It contains libraries for matrices, statistics, optimization, and moβ¦β8,385May 4, 2026Updated last month
- A next-generation curated knowledge sharing platform for data scientists and other technical professions.β5,532Sep 4, 2024Updated last year
- Fancy stream processing made operationally mundaneβ8,679Updated this week
- The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!β8,670Jun 3, 2026Updated last week