mlcommons / training_results_v2.1
This repository contains the results and code for the MLPerf™ Training v2.1 benchmark.
☆15Updated last year
Related projects ⓘ
Alternatives and complementary repositories for training_results_v2.1
- This repository contains the results and code for the MLPerf™ Training v2.0 benchmark.☆27Updated 8 months ago
- This repository contains the results and code for the MLPerf™ Training v1.0 benchmark.☆37Updated 8 months ago
- RCCL Performance Benchmark Tests☆49Updated 2 weeks ago
- MLPerf™ logging library☆30Updated last week
- oneCCL Bindings for Pytorch*☆86Updated 2 weeks ago
- Issues related to MLPerf™ training policies, including rules and suggested changes☆93Updated last month
- ☆55Updated 5 months ago
- Issues related to MLPerf™ Inference policies, including rules and suggested changes☆57Updated last week
- ☆48Updated 8 months ago
- Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large …☆63Updated 2 years ago
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆57Updated 2 months ago
- Training neural networks in TensorFlow 2.0 with 5x less memory☆128Updated 2 years ago
- Benchmarks to capture important workloads.☆28Updated 5 months ago
- An IR for efficiently simulating distributed ML computation.☆25Updated 10 months ago
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆19Updated last week
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆122Updated this week
- PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models. ICML 2021☆54Updated 3 years ago
- ☆26Updated 3 years ago
- A parallel framework for training deep neural networks☆43Updated last week
- (NeurIPS 2022) Automatically finding good model-parallel strategies, especially for complex models and clusters.☆34Updated 2 years ago
- A Python library transfers PyTorch tensors between CPU and NVMe☆96Updated this week
- ☆23Updated 9 months ago
- Cavs: An Efficient Runtime System for Dynamic Neural Networks☆13Updated 4 years ago
- Python bindings for NVTX☆66Updated last year
- Reference models for Intel(R) Gaudi(R) AI Accelerator☆155Updated this week
- This repository contains the results and code for the MLPerf™ Training v3.0 benchmark.☆12Updated last year
- Samples demonstrating how to use the Compute Sanitizer Tools and Public API☆65Updated last year
- FTPipe and related pipeline model parallelism research.☆41Updated last year
- Memory Optimizations for Deep Learning (ICML 2023)☆59Updated 7 months ago
- CUDA 12.2 HMM demos☆17Updated 3 months ago