ROCm / aws-ofi-rcclView external linksLinks
☆17Nov 11, 2025Updated 3 months ago
Alternatives and similar repositories for aws-ofi-rccl
Users that are interested in aws-ofi-rccl are comparing it to the libraries listed below
Sorting:
- This is the open source version of HPL-MXP. The code performance has been verified on Frontier☆18Jul 9, 2025Updated 7 months ago
- ext_mpi_collectives☆11Apr 1, 2025Updated 10 months ago
- Build tools for Open-CE☆13Nov 13, 2025Updated 3 months ago
- GPUDirect Async implementation of HPGMG-FV CUDA☆11May 11, 2018Updated 7 years ago
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆15Mar 19, 2023Updated 2 years ago
- High-Performance Linpack Benchmark adopted version for GPU backend☆12Sep 12, 2022Updated 3 years ago
- Scripts to build AMD ROCm from source.☆16Oct 31, 2024Updated last year
- ☆16Nov 19, 2025Updated 2 months ago
- Repo for climate deep learning codes☆16May 21, 2019Updated 6 years ago
- HPCG benchmark based on ROCm platform☆39Feb 3, 2026Updated last week
- Benchmarks☆17Apr 28, 2025Updated 9 months ago
- Time Ordered Astrophysics Scalable Tools☆44Updated this week
- OpenCL porting of the GROMACS molecular simulation toolkit☆27Sep 5, 2015Updated 10 years ago
- ☆18Jan 17, 2024Updated 2 years ago
- Scripts for building libraries with Cray's PE☆21Aug 31, 2021Updated 4 years ago
- This is the public repo for the MLPerf DeepCAM climate data segmentation proposal.☆16Sep 30, 2025Updated 4 months ago
- ☆24Oct 9, 2025Updated 4 months ago
- Reference implementations of MLPerf™ HPC training benchmarks☆49Feb 25, 2025Updated 11 months ago
- ☆60Updated this week
- single-GPU to multi-GPU training of PyTorch apps at NERSC☆22Apr 10, 2024Updated last year
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆57Updated this week
- ☆23Updated this week
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆25Updated this week
- Slurm Spank plugins for Quantum resources and jobs support☆47Updated this week
- NAS Parallel Benchmarks for evaluating GPU and APIs☆29Sep 29, 2025Updated 4 months ago
- A C++based implementation of the TeaLeaf heat conduction mini-app. This implementation of TeaLeaf replicates the functionality of the ref…☆25Aug 11, 2024Updated last year
- MAD (Model Automation and Dashboarding)☆31Feb 6, 2026Updated last week
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆22Feb 7, 2024Updated 2 years ago
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆66Dec 10, 2025Updated 2 months ago
- Competition instructions for the Center for High Performance Computing (CHPC) 2024 Student Cluster Compettion (SCC). Which is hosted by t…☆15Jan 30, 2026Updated 2 weeks ago
- E4S Spack environments and container recipes☆27Sep 20, 2025Updated 4 months ago
- Compute applications.☆25Dec 12, 2019Updated 6 years ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆144Updated this week
- HiCMA: Hierarchical Computations on Manycore Architectures☆34Mar 19, 2023Updated 2 years ago
- Sources for the Oak Ridge Leadership Computing Facility User Documentation☆66Updated this week
- NGC Container Replicator☆28Dec 26, 2022Updated 3 years ago
- ROCm Machine Learning and HPC Stack installer☆29Jul 31, 2020Updated 5 years ago
- Scripts for running various benchmarks on Isambard and other systems.☆29May 13, 2021Updated 4 years ago
- Create and deploy virtual-experiments - co-processing computational workflows☆10Jan 28, 2026Updated 2 weeks ago