eth-easl / modynLinks
Modyn is a research-platform for training ML models on growing datasets.
☆50Updated 5 months ago
Alternatives and similar repositories for modyn
Users that are interested in modyn are comparing it to the libraries listed below
Sorting:
- ML Input Data Processing as a Service. This repository contains the source code for Cachew (built on top of TensorFlow).☆39Updated last year
- LLM Serving Performance Evaluation Harness☆79Updated 8 months ago
- A resilient distributed training framework☆96Updated last year
- VQPy: An object-oriented approach to modern video analytics☆42Updated last year
- Stateful LLM Serving☆87Updated 7 months ago
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆161Updated last month
- Multi-Instance-GPU profiling tool☆60Updated 2 years ago
- [OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable☆188Updated last year
- Model-less Inference Serving☆92Updated last year
- Measure and optimize the energy consumption of your AI applications!☆306Updated this week
- ☆47Updated last year
- ☆38Updated 4 years ago
- AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)☆88Updated 2 years ago
- ☆31Updated 3 years ago
- SpotServe: Serving Generative Large Language Models on Preemptible Instances☆130Updated last year
- ☆74Updated 2 weeks ago
- (NeurIPS 2022) Automatically finding good model-parallel strategies, especially for complex models and clusters.☆41Updated 2 years ago
- A framework for generating realistic LLM serving workloads☆73Updated 3 weeks ago
- ☆63Updated last month
- Bamboo is a system for running large pipeline-parallel DNNs affordably, reliably, and efficiently using spot instances.☆53Updated 2 years ago
- Surrogate-based Hyperparameter Tuning System☆27Updated 2 years ago
- A library to analyze PyTorch traces.☆419Updated 2 weeks ago
- sensAI: ConvNets Decomposition via Class Parallelism for Fast Inference on Live Data☆65Updated last year
- An experimental parallel training platform☆54Updated last year
- Microsoft Collective Communication Library☆66Updated 11 months ago
- ☆65Updated 9 months ago
- A Python library transfers PyTorch tensors between CPU and NVMe☆120Updated 11 months ago
- Artifact of OSDI '24 paper, ”Llumnix: Dynamic Scheduling for Large Language Model Serving“☆62Updated last year
- PyTorch implementation of paper "Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline".☆92Updated 2 years ago
- 🔮 Execution time predictions for deep neural network training iterations across different GPUs.☆62Updated 2 years ago