google-research / ml-for-systems-taxonomyLinks
☆19Updated 4 years ago
Alternatives and similar repositories for ml-for-systems-taxonomy
Users that are interested in ml-for-systems-taxonomy are comparing it to the libraries listed below
Sorting:
- ☆47Updated 2 years ago
- ParaDnn: A systematic performance analysis methodology for deep learning.☆39Updated 5 years ago
- ☆43Updated last year
- An IR for efficiently simulating distributed ML computation.☆28Updated last year
- FTPipe and related pipeline model parallelism research.☆41Updated 2 years ago
- Cavs: An Efficient Runtime System for Dynamic Neural Networks☆14Updated 4 years ago
- 🔮 Execution time predictions for deep neural network training iterations across different GPUs.☆62Updated 2 years ago
- An Attention Superoptimizer☆21Updated 4 months ago
- Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020☆128Updated 10 months ago
- An experimental parallel training platform☆54Updated last year
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆141Updated this week
- ACT An Architectural Carbon Modeling Tool for Designing Sustainable Computer Systems☆40Updated 3 weeks ago
- RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads☆44Updated 4 years ago
- ☆40Updated 4 years ago
- Deadline-based hyperparameter tuning on RayTune.☆31Updated 5 years ago
- ☆15Updated 2 years ago
- Machine learning on serverless platform☆9Updated 2 years ago
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆34Updated 2 years ago
- ☆70Updated 3 years ago
- ☆12Updated 5 years ago
- Mille Crepe Bench: layer-wise performance analysis for deep learning frameworks.☆17Updated 5 years ago
- PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models. ICML 2021☆56Updated 3 years ago
- Development repository for integrating FlexFlow (A distributed deep learning framework that supports flexible parallelization strategies)…☆29Updated 3 years ago
- An analytical performance modeling tool for deep neural networks.☆89Updated 4 years ago
- Synthesizer for optimal collective communication algorithms☆107Updated last year
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Updated 5 years ago
- How much energy do GenAI models consume?☆42Updated 3 weeks ago
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆18Updated 2 years ago
- A schedule language for large model training☆148Updated 11 months ago
- Issues related to MLPerf™ Inference policies, including rules and suggested changes☆62Updated 3 months ago