pachyderm / pachydermLinks
Data-Centric Pipelines and Data Versioning
☆6,227Updated 3 months ago
Alternatives and similar repositories for pachyderm
Users that are interested in pachyderm are comparing it to the libraries listed below
Sorting:
- Machine Learning Toolkit for Kubernetes☆14,998Updated last month
- High-Performance Serverless event and data processing platform☆5,463Updated this week
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, vis…☆18,304Updated 2 weeks ago
- Build, Manage and Deploy AI/ML Systems☆8,838Updated last week
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models☆4,531Updated this week
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle☆3,641Updated last month
- 📚 Parameterize, execute, and analyze notebooks☆6,171Updated last month
- An open-source graph database☆14,923Updated 2 months ago
- Open Source ML Model Versioning, Metadata, and Experiment Management☆1,721Updated 10 months ago
- PipelineAI☆4,171Updated last year
- Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.☆6,251Updated this week
- A next-generation curated knowledge sharing platform for data scientists and other technical professions.☆5,520Updated 8 months ago
- Quilt is a data mesh for connecting people with actionable data☆1,342Updated last week
- A crazy fast analytical database, built on bitmaps. Perfect for ML applications. Learn more at: http://docs.featurebase.com/. Start a Doc…☆2,522Updated last year
- the portable Python dataframe library☆5,784Updated this week
- The Open Source Feature Store for AI/ML☆6,108Updated this week
- Production infrastructure for machine learning at scale☆8,037Updated 11 months ago
- M3 monorepo - Distributed TSDB, Aggregator and Query Engine, Prometheus Sidecar, Graphite Compatible, Metrics Platform☆4,827Updated this week
- Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting…☆4,580Updated 2 weeks ago
- Parallel computing with task scheduling☆13,237Updated last week
- Spinnaker is an open source, multi-cloud continuous delivery platform for releasing software changes with high velocity and confidence.☆9,477Updated last week
- NumPy and Pandas interface to Big Data☆3,199Updated last year
- A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin☆6,371Updated 2 months ago
- 🦉 Data Versioning and ML Experiments☆14,494Updated this week
- Feather: fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow☆2,744Updated 3 years ago
- Grumpy is a Python to Go source code transcompiler and runtime.☆10,540Updated 3 years ago
- A tool for developers to create cloud-native applications on Kubernetes.☆3,913Updated 11 months ago
- A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow☆2,080Updated last year
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics☆15,481Updated this week
- Kubernetes Native Serverless Framework☆6,866Updated 3 years ago