google-research / ml-for-systems-taxonomy
โ18Updated 4 years ago
Alternatives and similar repositories for ml-for-systems-taxonomy:
Users that are interested in ml-for-systems-taxonomy are comparing it to the libraries listed below
- โ47Updated 2 years ago
- ๐ฎ Execution time predictions for deep neural network training iterations across different GPUs.โ60Updated 2 years ago
- An IR for efficiently simulating distributed ML computation.โ28Updated last year
- Cavs: An Efficient Runtime System for Dynamic Neural Networksโ14Updated 4 years ago
- An Attention Superoptimizerโ21Updated 3 months ago
- FTPipe and related pipeline model parallelism research.โ41Updated last year
- An experimental parallel training platformโ54Updated last year
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets forโฆโ136Updated this week
- Deferred Continuous Batching in Resource-Efficient Large Language Model Serving (EuroMLSys 2024)โ15Updated 10 months ago
- RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloadsโ43Updated 4 years ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launchesโ15Updated 5 years ago
- โ24Updated last year
- A schedule language for large model trainingโ146Updated 10 months ago
- Set of datasets for the deep learning recommendation model (DLRM).โ45Updated 2 years ago
- ParaDnn: A systematic performance analysis methodology for deep learning.โ39Updated 5 years ago
- Dynamic resources changes for multi-dimensional parallelism trainingโ25Updated 5 months ago
- ACT An Architectural Carbon Modeling Tool for Designing Sustainable Computer Systemsโ39Updated last week
- How much energy do GenAI models consume?โ42Updated 6 months ago
- A Framework for Reasoning about System Performance using Causal AIโ42Updated 3 years ago
- LLM Inference analyzer for different hardware platformsโ62Updated 2 weeks ago
- โ24Updated last year
- Architecture-level Fault Injection Tool for GPU Application Resilience Evaluationโ58Updated last year
- Issues related to MLPerfโข Inference policies, including rules and suggested changesโ62Updated last month
- Modified version of PyTorch able to work with changes to GPGPU-Simโ52Updated 2 years ago
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapoโ18Updated 2 years ago
- An external memory allocator example for PyTorch.โ14Updated 3 years ago
- A resilient distributed training frameworkโ94Updated last year
- Boost hardware utilization for ML training workloads via Inter-model Horizontal Fusionโ32Updated 11 months ago
- โ16Updated 2 years ago
- โ14Updated 2 years ago