pachyderm / pachydermLinks
Data-Centric Pipelines and Data Versioning
β6,234Updated 4 months ago
Alternatives and similar repositories for pachyderm
Users that are interested in pachyderm are comparing it to the libraries listed below
Sorting:
- π Parameterize, execute, and analyze notebooksβ6,197Updated 2 months ago
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visβ¦β18,343Updated last month
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycleβ3,645Updated last week
- Machine Learning Toolkit for Kubernetesβ15,047Updated 2 weeks ago
- π¦ Data Versioning and ML Experimentsβ14,563Updated this week
- A next-generation curated knowledge sharing platform for data scientists and other technical professions.β5,522Updated 9 months ago
- An open-source graph databaseβ14,935Updated 2 months ago
- High-Performance Serverless event and data processing platformβ5,482Updated last week
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning modelsβ4,554Updated this week
- Build, Manage and Deploy AI/ML Systemsβ8,888Updated this week
- Feather: fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrowβ2,746Updated 3 years ago
- Spinnaker is an open source, multi-cloud continuous delivery platform for releasing software changes with high velocity and confidence.β9,493Updated this week
- NumPy and Pandas interface to Big Dataβ3,200Updated last year
- Quilt is a data mesh for connecting people with actionable dataβ1,342Updated last week
- Parallel computing with task schedulingβ13,277Updated this week
- Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduβ¦β8,571Updated 8 months ago
- Build powerful pipelines in any programming language.β5,224Updated last year
- M3 monorepo - Distributed TSDB, Aggregator and Query Engine, Prometheus Sidecar, Graphite Compatible, Metrics Platformβ4,828Updated this week
- Production infrastructure for machine learning at scaleβ8,036Updated last year
- the portable Python dataframe libraryβ5,849Updated this week
- βΎοΈ CML - Continuous Machine Learning | CI/CD for MLβ4,107Updated 2 weeks ago
- Machine Learning Pipelines for Kubeflowβ3,861Updated this week
- [UNMAINTAINED] A next generation open source platform as a service (PaaS)β7,846Updated 3 years ago
- Apache Pinot - A realtime distributed OLAP datastoreβ5,809Updated this week
- Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.β6,297Updated this week
- lakeFS - Data version control for your data lake | Git for dataβ4,722Updated this week
- A curated list of awesome pipeline toolkits inspired by Awesome Sysadminβ6,390Updated 3 months ago
- Open Source ML Model Versioning, Metadata, and Experiment Managementβ1,723Updated 10 months ago
- [Project ended] rkt is a pod-native container engine for Linux. It is composable, secure, and built on standards.β8,808Updated 5 years ago
- Cadence is a distributed, scalable, durable, and highly available orchestration engine to execute asynchronous long-running business logiβ¦β8,719Updated last week