google-research / ml-for-systems-taxonomyLinks
☆19Updated 4 years ago
Alternatives and similar repositories for ml-for-systems-taxonomy
Users that are interested in ml-for-systems-taxonomy are comparing it to the libraries listed below
Sorting:
- ☆47Updated 2 years ago
- ParaDnn: A systematic performance analysis methodology for deep learning.☆39Updated 5 years ago
- Architecture-level Fault Injection Tool for GPU Application Resilience Evaluation☆73Updated last year
- Issues related to MLPerf™ Inference policies, including rules and suggested changes☆64Updated 3 weeks ago
- A Framework for Reasoning about System Performance using Causal AI☆43Updated 3 years ago
- How much energy do GenAI models consume?☆47Updated 4 months ago
- AI and Memory Wall☆218Updated last year
- Deferred Continuous Batching in Resource-Efficient Large Language Model Serving (EuroMLSys 2024)☆18Updated last year
- 🔮 Execution time predictions for deep neural network training iterations across different GPUs.☆63Updated 2 years ago
- Boost hardware utilization for ML training workloads via Inter-model Horizontal Fusion☆32Updated last year
- An IR for efficiently simulating distributed ML computation.☆29Updated last year
- A schedule language for large model training☆151Updated last month
- ☆42Updated 2 years ago
- Simple Distributed Deep Learning on TensorFlow☆134Updated 3 months ago
- Benchmarks to capture important workloads.☆31Updated 8 months ago
- An experimental parallel training platform☆54Updated last year
- An analytical performance modeling tool for deep neural networks.☆91Updated 5 years ago
- RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads☆44Updated 4 years ago
- ☆40Updated 4 years ago
- ☆70Updated 4 years ago
- Issues related to MLPerf™ training policies, including rules and suggested changes☆95Updated 2 weeks ago
- ☆14Updated 2 years ago
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆151Updated last week
- This is the implementation for paper: AdaTune: Adaptive Tensor Program CompilationMade Efficient (NeurIPS 2020).☆14Updated 4 years ago
- Source code for the paper: "A Latency-Predictable Multi-Dimensional Optimization Framework forDNN-driven Autonomous Systems"☆22Updated 4 years ago
- A runtime fault injection tool for PyTorch☆120Updated last year
- ☆25Updated 2 years ago
- Model-less Inference Serving☆92Updated last year
- Set of datasets for the deep learning recommendation model (DLRM).☆47Updated 2 years ago
- ACT An Architectural Carbon Modeling Tool for Designing Sustainable Computer Systems☆43Updated 2 months ago