mlsys-seo / ooo-backprop
☆25Updated 2 years ago
Alternatives and similar repositories for ooo-backprop:
Users that are interested in ooo-backprop are comparing it to the libraries listed below
- LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale☆80Updated 3 weeks ago
- ☆102Updated last year
- Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access (ACM EuroSys '23)☆55Updated 10 months ago
- ☆24Updated last year
- (NeurIPS 2022) Automatically finding good model-parallel strategies, especially for complex models and clusters.☆37Updated 2 years ago
- Study Group of Deep Learning Compiler☆156Updated 2 years ago
- FriendliAI Model Hub☆89Updated 2 years ago
- ☆47Updated 2 months ago
- Network Contention-Aware Cluster Scheduling with Reinforcement Learning (IEEE ICPADS 2023)☆15Updated 3 months ago
- ☆13Updated 3 weeks ago
- FTPipe and related pipeline model parallelism research.☆41Updated last year
- ☆25Updated 6 years ago
- Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling☆9Updated 10 months ago
- ☆53Updated 4 years ago
- ☆64Updated 2 months ago
- one-shot-tuner☆8Updated 2 years ago
- [ATC '24] Metis: Fast automatic distributed training on heterogeneous GPUs (https://www.usenix.org/conference/atc24/presentation/um)☆24Updated 2 months ago
- QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inference☆114Updated 10 months ago
- ☆70Updated 3 years ago
- "JABAS: Joint Adaptive Batching and Automatic Scaling for DNN Training on Heterogeneous GPUs" (EuroSys '25)☆12Updated last week
- ☆12Updated 4 months ago
- ☆73Updated 2 years ago
- FastFlow is a system that automatically detects CPU bottlenecks in deep learning training pipelines and resolves the bottlenecks with dat…☆26Updated last year
- A performance library for machine learning applications.☆183Updated last year
- ☆48Updated 9 months ago
- A resilient distributed training framework☆88Updated 9 months ago
- ☆43Updated 4 months ago
- Multi-Instance-GPU profiling tool☆56Updated last year
- Set of datasets for the deep learning recommendation model (DLRM).☆41Updated 2 years ago
- PyTorch-UVM on super-large language models.☆14Updated 4 years ago