NVIDIA / mlperf-common
NVIDIA's launch, startup, and logging scripts used by our MLPerf Training and HPC submissions
☆25Updated last week
Alternatives and similar repositories for mlperf-common:
Users that are interested in mlperf-common are comparing it to the libraries listed below
- oneCCL Bindings for Pytorch*☆89Updated last week
- MLPerf™ logging library☆33Updated last week
- Benchmarks to capture important workloads.☆29Updated last month
- Reference implementations of MLPerf™ HPC training benchmarks☆47Updated 3 weeks ago
- Bandwidth test for ROCm☆54Updated last week
- Samples demonstrating how to use the Compute Sanitizer Tools and Public API☆76Updated last year
- RCCL Performance Benchmark Tests☆60Updated last week
- Issues related to MLPerf™ Inference policies, including rules and suggested changes☆60Updated 3 weeks ago
- A Deep Learning Meta-Framework and HPC Benchmarking Library☆81Updated 2 years ago
- Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large …☆63Updated 3 years ago
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆106Updated this week
- This repository contains the results and code for the MLPerf™ Training v0.7 benchmark.☆56Updated last year
- ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs☆79Updated this week
- General policies for MLPerf™ including submission rules, coding standards, etc.☆28Updated last month
- MPI accelerator-integrated communication extensions☆32Updated last year
- ROCm BLAS marshalling library☆133Updated this week
- pytorch ucc plugin☆19Updated 3 years ago
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆62Updated last week
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆30Updated 3 months ago
- A task benchmark☆41Updated 7 months ago
- CUDA Templates for Linear Algebra Subroutines☆15Updated last week
- ☆24Updated this week
- oneAPI Collective Communications Library (oneCCL)☆224Updated 2 weeks ago
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆132Updated this week
- ROC profiler library. Profiling with perf-counters and derived metrics.☆137Updated this week
- ROCm SPARSE marshalling library☆67Updated last week
- HPCG benchmark based on ROCm platform☆37Updated last week
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆51Updated 3 weeks ago
- Machine Intelligence Shader Autogen. AMDGPU ML shader code generator. (previously iGEMMgen)☆34Updated this week
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆21Updated last year