mlsys-seo / ooo-backprop
☆25Updated 2 years ago
Alternatives and similar repositories for ooo-backprop:
Users that are interested in ooo-backprop are comparing it to the libraries listed below
- ☆102Updated last year
- [ACM EuroSys '23] Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access☆57Updated last year
- ☆24Updated last year
- ☆64Updated last week
- LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale☆98Updated last month
- (NeurIPS 2022) Automatically finding good model-parallel strategies, especially for complex models and clusters.☆38Updated 2 years ago
- Study Group of Deep Learning Compiler☆157Updated 2 years ago
- FriendliAI Model Hub☆92Updated 2 years ago
- ☆51Updated 4 months ago
- Welcome to PeriFlow CLI ☁︎☆12Updated last year
- ☆45Updated 6 months ago
- Microsoft Collective Communication Library☆64Updated 4 months ago
- ☆24Updated 6 years ago
- A resilient distributed training framework☆93Updated 11 months ago
- Chimera: bidirectional pipeline parallelism for efficiently training large-scale models.☆61Updated 2 weeks ago
- [ATC '24] Metis: Fast automatic distributed training on heterogeneous GPUs (https://www.usenix.org/conference/atc24/presentation/um)☆25Updated 4 months ago
- Synthesizer for optimal collective communication algorithms☆105Updated 11 months ago
- ☆12Updated 6 months ago
- ☆72Updated 3 years ago
- Network Contention-Aware Cluster Scheduling with Reinforcement Learning (IEEE ICPADS 2023)☆16Updated 5 months ago
- TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches☆72Updated last year
- ☆77Updated 2 years ago
- ☆43Updated 11 months ago
- FastFlow is a system that automatically detects CPU bottlenecks in deep learning training pipelines and resolves the bottlenecks with dat…☆26Updated last year
- Thunder Research Group's Collective Communication Library☆34Updated 11 months ago
- Experimental deep learning framework written in Rust☆14Updated 2 years ago
- ☆16Updated 2 years ago
- Multi-Instance-GPU profiling tool☆57Updated last year
- Research and development for optimizing transformers☆125Updated 4 years ago
- ☆47Updated 3 months ago