pachyderm / pachydermLinks
Data-Centric Pipelines and Data Versioning
☆6,278Updated 10 months ago
Alternatives and similar repositories for pachyderm
Users that are interested in pachyderm are comparing it to the libraries listed below
Sorting:
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle☆3,688Updated this week
- High-Performance Serverless event and data processing platform☆5,641Updated last week
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models☆4,703Updated this week
- Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.☆6,654Updated this week
- Machine Learning Toolkit for Kubernetes☆15,378Updated last week
- 📚 Parameterize, execute, and analyze notebooks☆6,345Updated last month
- Quilt is a data mesh for connecting people with actionable data☆1,356Updated last week
- A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin☆6,497Updated 2 months ago
- Production infrastructure for machine learning at scale☆8,031Updated last year
- ♾️ CML - Continuous Machine Learning | CI/CD for ML☆4,161Updated 6 months ago
- Build, Manage and Deploy AI/ML Systems☆9,686Updated last week
- Open Source ML Model Versioning, Metadata, and Experiment Management☆1,743Updated last year
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics☆16,318Updated this week
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, vis…☆18,611Updated 7 months ago
- the portable Python dataframe library☆6,312Updated this week
- Parallel computing with task scheduling☆13,679Updated this week
- 🦉 Data Versioning and ML Experiments☆15,229Updated this week
- A crazy fast analytical database, built on bitmaps. Perfect for ML applications. Learn more at: http://docs.featurebase.com/. Start a Doc…☆2,526Updated last year
- A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow☆2,085Updated 2 years ago
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,870Updated last week
- M3 monorepo - Distributed TSDB, Aggregator and Query Engine, Prometheus Sidecar, Graphite Compatible, Metrics Platform☆4,878Updated last month
- A GPU-powered real-time analytics storage and query engine.☆3,074Updated last year
- A next-generation curated knowledge sharing platform for data scientists and other technical professions.☆5,538Updated last year
- The Open Source Feature Store for AI/ML☆6,561Updated last week
- Lore makes machine learning approachable for Software Engineers and maintainable for Machine Learning Researchers☆1,547Updated 2 years ago
- Beaker Extensions for Jupyter Notebook☆2,836Updated 2 years ago
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,464Updated 2 months ago
- Always know what to expect from your data.☆11,039Updated last week
- An open-source graph database☆15,009Updated last month
- Build powerful pipelines in any programming language.☆5,232Updated 2 years ago