AMD lab notes with code examples to demonstrate use of AMD GPUs
☆110Jun 28, 2024Updated last year
Alternatives and similar repositories for amd-lab-notes
Users that are interested in amd-lab-notes are comparing it to the libraries listed below
Sorting:
- ☆38Updated this week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆144Updated this week
- ☆16Nov 19, 2025Updated 3 months ago
- HIP backend patch for Numba, the NumPy aware dynamic Python compiler using LLVM.☆18Feb 16, 2026Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆139Updated this week
- This is the public repo for the MLPerf DeepCAM climate data segmentation proposal.☆16Sep 30, 2025Updated 5 months ago
- HPCG benchmark based on ROCm platform☆39Updated this week
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆67Dec 10, 2025Updated 2 months ago
- ☆71Updated this week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆165Feb 16, 2026Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo. NOTE: develop branch is maintained as a read-only mirror☆521Updated this week
- ☆23Feb 17, 2026Updated last week
- ☆12Aug 4, 2025Updated 6 months ago
- ExaWorks SDK☆11Feb 1, 2024Updated 2 years ago
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆22Feb 7, 2024Updated 2 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆113Updated this week
- A Micro-benchmarking Tool for HPC Networks☆34Sep 2, 2025Updated 5 months ago
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆127Nov 14, 2025Updated 3 months ago
- Example codes for ATPESC☆14Jul 31, 2025Updated 7 months ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆154Jan 21, 2026Updated last month
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆15Mar 19, 2023Updated 2 years ago
- ☆15Oct 20, 2020Updated 5 years ago
- A Benchmark Toolkit for Assembly Instructions Using the LLVM JIT☆17Oct 26, 2020Updated 5 years ago
- Scalable GPU Kernel Fission/Fusion Transformation for Memory-Bound Kernels☆14Aug 26, 2015Updated 10 years ago
- High-Performance Linpack Benchmark adopted version for GPU backend☆12Sep 12, 2022Updated 3 years ago
- Scripts to build AMD ROCm from source.☆16Oct 31, 2024Updated last year
- ☆17Nov 11, 2025Updated 3 months ago
- Repo for a DOE HPC workflow training event☆13Apr 28, 2023Updated 2 years ago
- This is the open source version of HPL-MXP. The code performance has been verified on Frontier☆18Jul 9, 2025Updated 7 months ago
- Department of Energy Standard Utility Library☆33Jan 30, 2026Updated last month
- This repository contains the results and code for the MLPerf™ Training v2.0 benchmark.☆29Feb 23, 2024Updated 2 years ago
- LLVM/MLIR based compiler instrumentation of AMD GPU kernels☆19Jul 13, 2025Updated 7 months ago
- Tangent space normal map blending via quaternion rotation☆30May 21, 2017Updated 8 years ago
- A collection of examples for the ROCm software stack☆279Updated this week
- Debug print operator for cudagraph debugging☆14Aug 2, 2024Updated last year
- A GPU performance prediction toolkit for CUDA programs☆18Mar 25, 2019Updated 6 years ago
- hosted by HPC System Test Working Group collaboration☆17Feb 17, 2026Updated last week
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆94Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆255Feb 10, 2026Updated 2 weeks ago