pachyderm / pachyderm
Data-Centric Pipelines and Data Versioning
☆6,181Updated this week
Related projects ⓘ
Alternatives and complementary repositories for pachyderm
- Machine Learning Toolkit for Kubernetes☆14,399Updated this week
- High-Performance Serverless event and data processing platform☆5,319Updated this week
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, vis…☆17,883Updated last week
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle☆3,571Updated this week
- Production infrastructure for machine learning at scale☆8,021Updated 5 months ago
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models☆4,386Updated this week
- Open Source Platform for developing, scaling and deploying serious ML, AI, and data science systems☆8,256Updated this week
- Workflow Engine for Kubernetes☆15,099Updated this week
- Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.☆5,787Updated this week
- 🦉 Data Versioning and ML Experiments☆13,927Updated this week
- Parallel computing with task scheduling☆12,604Updated this week
- Kubernetes Native Serverless Framework☆6,862Updated 2 years ago
- A next-generation curated knowledge sharing platform for data scientists and other technical professions.☆5,482Updated 2 months ago
- the portable Python dataframe library☆5,318Updated this week
- 📚 Parameterize, execute, and analyze notebooks☆5,977Updated last month
- The Open Source Feature Store for Machine Learning☆5,613Updated this week
- PipelineAI☆4,172Updated 7 months ago
- Fast and Simple Serverless Functions for Kubernetes☆8,424Updated this week
- An open-source graph database☆14,859Updated 4 months ago
- Quilt is a data mesh for connecting people with actionable data☆1,330Updated this week
- Easy and Repeatable Kubernetes Development☆15,058Updated this week
- Always know what to expect from your data.☆9,997Updated this week
- Cadence is a distributed, scalable, durable, and highly available orchestration engine to execute asynchronous long-running business logi…☆8,319Updated this week
- [Project ended] rkt is a pod-native container engine for Linux. It is composable, secure, and built on standards.☆8,822Updated 4 years ago
- Build powerful pipelines in any programming language.☆5,200Updated last year
- Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting…☆4,440Updated last week
- Feather: fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow☆2,742Updated 3 years ago
- Spinnaker is an open source, multi-cloud continuous delivery platform for releasing software changes with high velocity and confidence.☆9,344Updated this week
- The Cloud Operational Data Store: use SQL to transform, deliver, and act on fast-changing data.☆5,809Updated this week
- Ready-to-run Docker images containing Jupyter applications☆8,004Updated this week