mlcommons / training_results_v3.0Links
This repository contains the results and code for the MLPerf™ Training v3.0 benchmark.
☆12Updated 2 years ago
Alternatives and similar repositories for training_results_v3.0
Users that are interested in training_results_v3.0 are comparing it to the libraries listed below
Sorting:
- This repository contains the results and code for the MLPerf™ Inference v4.0 benchmark.☆10Updated 6 months ago
- oneCCL Bindings for Pytorch* (deprecated)☆104Updated last month
- SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUs☆66Updated last week
- This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.☆203Updated last week
- oneAPI Collective Communications Library (oneCCL)☆254Updated last week
- A tool for bandwidth measurements on NVIDIA GPUs.☆617Updated 9 months ago
- Python bindings for NVTX☆67Updated 2 years ago
- ☆61Updated last year
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆144Updated this week
- Instructions, Docker images, and examples for Nsight Compute and Nsight Systems☆135Updated 5 years ago
- Reference implementations of MLPerf™ HPC training benchmarks☆49Updated 11 months ago
- Test suite for probing the numerical behavior of NVIDIA tensor cores☆41Updated last year
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆380Updated this week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆410Updated last week
- Experimental projects related to TensorRT☆118Updated last week
- Assembler for NVIDIA Volta and Turing GPUs☆239Updated 4 years ago
- CloudAI Benchmark Framework☆82Updated this week
- Bandwidth test for ROCm☆75Updated last week
- This repository contains the results and code for the MLPerf™ Training v2.0 benchmark.☆29Updated last year
- Convert nvprof profiles into about:tracing compatible JSON files☆73Updated 4 years ago
- RDMA and SHARP plugins for nccl library☆221Updated 3 weeks ago
- Training material for Nsight developer tools☆178Updated last year
- ☆29Updated 4 months ago
- Ahead of Time (AOT) Triton Math Library☆88Updated last week
- NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the …☆255Updated this week
- Microsoft Collective Communication Library☆381Updated 2 years ago
- OpenAI Triton backend for Intel® GPUs☆226Updated this week
- A library to analyze PyTorch traces.☆462Updated this week
- Synthesizer for optimal collective communication algorithms☆124Updated last year
- NCCL Profiling Kit☆150Updated last year