pachyderm / pachydermLinks
Data-Centric Pipelines and Data Versioning
☆6,277Updated 11 months ago
Alternatives and similar repositories for pachyderm
Users that are interested in pachyderm are comparing it to the libraries listed below
Sorting:
- Machine Learning Toolkit for Kubernetes☆15,387Updated this week
- High-Performance Serverless event and data processing platform☆5,666Updated this week
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle☆3,691Updated this week
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models☆4,704Updated this week
- Quilt is a data mesh for connecting people with actionable data☆1,357Updated this week
- 📚 Parameterize, execute, and analyze notebooks☆6,349Updated this week
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, vis…☆18,616Updated this week
- A crazy fast analytical database, built on bitmaps. Perfect for ML applications. Learn more at: http://docs.featurebase.com/. Start a Doc…☆2,527Updated last year
- An open-source graph database☆15,016Updated last month
- 🦉 Data Versioning and ML Experiments☆15,249Updated this week
- A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin☆6,502Updated 2 months ago
- M3 monorepo - Distributed TSDB, Aggregator and Query Engine, Prometheus Sidecar, Graphite Compatible, Metrics Platform☆4,879Updated last month
- Open Source ML Model Versioning, Metadata, and Experiment Management☆1,744Updated last year
- Parallel computing with task scheduling☆13,695Updated this week
- A next-generation curated knowledge sharing platform for data scientists and other technical professions.☆5,538Updated last year
- A GPU-powered real-time analytics storage and query engine.☆3,077Updated last year
- A low-latency prediction-serving system☆1,421Updated 4 years ago
- Build, Manage and Deploy AI/ML Systems☆9,702Updated this week
- A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow☆2,085Updated 2 years ago
- Production infrastructure for machine learning at scale☆8,029Updated last year
- Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.☆6,663Updated this week
- Beaker Extensions for Jupyter Notebook☆2,836Updated 2 years ago
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,871Updated last week
- The Open Source Feature Store for AI/ML☆6,603Updated this week
- An orchestration platform for the development, production, and observation of data assets.☆14,717Updated this week
- ♾️ CML - Continuous Machine Learning | CI/CD for ML☆4,161Updated 7 months ago
- Service orchestration and management tool.☆6,039Updated last month
- The versioned, forkable, syncable database☆7,441Updated 4 years ago
- lakeFS - Data version control for your data lake | Git for data☆5,076Updated this week
- A system for quickly generating training data with weak supervision☆5,933Updated last year