pachyderm / pachydermLinks
Data-Centric Pipelines and Data Versioning
☆6,249Updated 6 months ago
Alternatives and similar repositories for pachyderm
Users that are interested in pachyderm are comparing it to the libraries listed below
Sorting:
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle☆3,668Updated last week
- High-Performance Serverless event and data processing platform☆5,561Updated this week
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models☆4,608Updated last week
- Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.☆6,433Updated last week
- 📚 Parameterize, execute, and analyze notebooks☆6,248Updated last month
- Machine Learning Toolkit for Kubernetes☆15,147Updated 2 weeks ago
- Open Source ML Model Versioning, Metadata, and Experiment Management☆1,737Updated last year
- 🦉 Data Versioning and ML Experiments☆14,794Updated last week
- A next-generation curated knowledge sharing platform for data scientists and other technical professions.☆5,525Updated 11 months ago
- Production infrastructure for machine learning at scale☆8,032Updated last year
- A GPU-powered real-time analytics storage and query engine.☆3,065Updated last year
- Beaker Extensions for Jupyter Notebook☆2,830Updated last year
- lakeFS - Data version control for your data lake | Git for data☆4,842Updated this week
- PipelineAI☆4,171Updated last year
- Run your code in the cloud, with technology so advanced, it feels like magic!☆2,644Updated last week
- M3 monorepo - Distributed TSDB, Aggregator and Query Engine, Prometheus Sidecar, Graphite Compatible, Metrics Platform☆4,842Updated this week
- A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin☆6,435Updated last week
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, vis…☆18,448Updated 3 months ago
- Build, Manage and Deploy AI/ML Systems☆9,432Updated this week
- A crazy fast analytical database, built on bitmaps. Perfect for ML applications. Learn more at: http://docs.featurebase.com/. Start a Doc…☆2,520Updated last year
- Kafka implemented in Golang with built-in coordination (No ZK dep, single binary install, Cloud Native)☆4,994Updated last year
- An orchestration platform for the development, production, and observation of data assets.☆13,875Updated last week
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,852Updated 2 weeks ago
- Cadence is a distributed, scalable, durable, and highly available orchestration engine to execute asynchronous long-running business logi…☆8,828Updated this week
- Build powerful pipelines in any programming language.☆5,233Updated 2 years ago
- An open-source graph database☆14,957Updated 2 months ago
- Lore makes machine learning approachable for Software Engineers and maintainable for Machine Learning Researchers☆1,549Updated 2 years ago
- Parallel computing with task scheduling☆13,428Updated last week
- The leader in Customer Data Infrastructure☆6,952Updated 2 months ago
- s3git: git for Cloud Storage. Distributed Version Control for Data. Create decentralized and versioned repos that scale infinitely to 100…☆1,462Updated 9 years ago