mlcommons / hpc
Reference implementations of MLPerf™ HPC training benchmarks
☆42Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for hpc
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆20Updated 9 months ago
- A Deep Learning Meta-Framework and HPC Benchmarking Library☆81Updated 2 years ago
- RCCL Performance Benchmark Tests☆51Updated last month
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆43Updated last month
- ROC_SHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆41Updated last year
- Very-Low Overhead Checkpointing System☆54Updated last month
- This is the public repo for the MLPerf DeepCAM climate data segmentation proposal.☆15Updated 2 years ago
- pytorch ucc plugin☆17Updated 3 years ago
- Using C++ magic to launch/capture CUDA kernels and tune them with Kernel Tuner☆19Updated 6 months ago
- A Micro-benchmarking Tool for HPC Networks☆22Updated 3 weeks ago
- HPCG benchmark based on ROCm platform☆35Updated this week
- Prototype of OpenSHMEM for NVIDIA GPUs, developed as part of DoE Design Forward☆20Updated 6 years ago
- MPI benchmark to test and measure collective performance☆49Updated 3 years ago
- ☆17Updated 10 months ago
- XSBench: The Monte Carlo Macroscopic Cross Section Lookup Benchmark☆72Updated 8 months ago
- ☆12Updated 8 months ago
- High-performance, GPU-aware communication library☆84Updated last month
- Intel HPC Containers using Singularity☆19Updated last year
- Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite☆60Updated 6 years ago
- OpenMP vs Offload☆21Updated last year
- ☆41Updated 4 years ago
- Advanced Profiling and Analytics for AMD Hardware☆138Updated this week
- Graph-indexed Pandas DataFrames for analyzing hierarchical performance data☆30Updated 3 weeks ago
- Drishti provides I/O insights to help you improve your application's I/O performance.☆19Updated 3 weeks ago
- Pytorch process group third-party plugin for UCC☆20Updated 7 months ago
- ☆36Updated 5 months ago
- Benchmarks☆15Updated last month
- An I/O benchmark for deep Learning applications☆70Updated 3 weeks ago
- A tracing infrastructure for heterogeneous computing applications.☆25Updated last week
- ☆10Updated 3 months ago