google-research / ml-for-systems-taxonomyLinks
☆19Updated 4 years ago
Alternatives and similar repositories for ml-for-systems-taxonomy
Users that are interested in ml-for-systems-taxonomy are comparing it to the libraries listed below
Sorting:
- ☆47Updated 2 years ago
- Issues related to MLPerf™ Inference policies, including rules and suggested changes☆63Updated last month
- RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads☆44Updated 4 years ago
- An IR for efficiently simulating distributed ML computation.☆29Updated last year
- A schedule language for large model training☆149Updated last week
- ☆40Updated 4 years ago
- Architecture-level Fault Injection Tool for GPU Application Resilience Evaluation☆69Updated last year
- ParaDnn: A systematic performance analysis methodology for deep learning.☆39Updated 5 years ago
- An experimental parallel training platform☆54Updated last year
- ☆70Updated 4 years ago
- Simple Distributed Deep Learning on TensorFlow☆133Updated 2 months ago
- http://vlsiarch.eecs.harvard.edu/research/recommendation/☆136Updated 2 years ago
- Issues related to MLPerf™ training policies, including rules and suggested changes☆95Updated last week
- Set of datasets for the deep learning recommendation model (DLRM).☆47Updated 2 years ago
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆149Updated last week
- ☆43Updated last year
- AI and Memory Wall☆219Updated last year
- Development repository for integrating FlexFlow (A distributed deep learning framework that supports flexible parallelization strategies)…☆28Updated 3 years ago
- Boost hardware utilization for ML training workloads via Inter-model Horizontal Fusion☆32Updated last year
- Slides from 2021-12-15 talk, "TVM Developer Bootcamp – Writing Hardware Backends"☆10Updated 3 years ago
- Deferred Continuous Batching in Resource-Efficient Large Language Model Serving (EuroMLSys 2024)☆16Updated last year
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Updated 6 years ago
- Model-less Inference Serving☆91Updated last year
- ☆14Updated 2 years ago
- How much energy do GenAI models consume?☆47Updated 3 months ago
- An analytical performance modeling tool for deep neural networks.☆89Updated 4 years ago
- A Framework for Reasoning about System Performance using Causal AI☆42Updated 3 years ago
- Microsoft Collective Communication Library☆67Updated 9 months ago
- FTPipe and related pipeline model parallelism research.☆41Updated 2 years ago
- ☆24Updated 2 years ago