[DEPRECATED] Moved to ROCm/rocm-libraries repo
☆55Jan 22, 2026Updated last month
Alternatives and similar repositories for hipDNN
Users that are interested in hipDNN are comparing it to the libraries listed below
Sorting:
- benchmarking miopen☆17Jan 14, 2019Updated 7 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆148Jan 27, 2026Updated last month
- MIOpenGEMM is now deprecated☆61Jul 17, 2023Updated 2 years ago
- ☆14Mar 21, 2019Updated 6 years ago
- CL Offline Compiler : Compile OpenCL kernels to HSAIL☆51May 5, 2017Updated 8 years ago
- ROCm Machine Learning and HPC Stack installer☆29Jul 31, 2020Updated 5 years ago
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆15Mar 19, 2023Updated 2 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆139Feb 27, 2026Updated last week
- ☆12Dec 20, 2019Updated 6 years ago
- Yaksa: High-performance Noncontiguous Data Management☆15Oct 1, 2025Updated 5 months ago
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆12Jun 24, 2024Updated last year
- ☆24Jun 12, 2023Updated 2 years ago
- Cosmic Tagging Network for Neutrino Physics☆13Jun 26, 2024Updated last year
- ☆21Nov 10, 2020Updated 5 years ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆154Jan 21, 2026Updated last month
- Pragmatic, Productive, and Portable Affinity for HPC☆51Feb 26, 2026Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆390Feb 24, 2026Updated last week
- IMPORTANT NOTICE: This implementation is long outdated. Whole-Function Vectorization is an algorithm that transforms a scalar function in…☆22May 16, 2012Updated 13 years ago
- MPI Benchmark on AWS HPC cluster☆20Jan 31, 2020Updated 6 years ago
- ROCm OpenCL Compiler Tool Driver☆24Nov 22, 2019Updated 6 years ago
- ROCm Driver RDMA Peer to Peer Support☆23Mar 21, 2019Updated 6 years ago
- Provides the examples to write and build Habana custom kernels using the HabanaTools☆25Apr 15, 2025Updated 10 months ago
- HCC is an Open Source, Optimizing C++ Compiler for Heterogeneous Compute currently for the ROCm GPU Computing Platform☆438Jun 10, 2020Updated 5 years ago
- Examples for HIP☆214Dec 5, 2024Updated last year
- ☆169Updated this week
- Compute applications.☆25Dec 12, 2019Updated 6 years ago
- Subset of BLAS routines optimized for NVIDIA GPUs☆77Mar 27, 2023Updated 2 years ago
- (Deprecated) hipCaffe: the HIP port of Caffe☆124May 1, 2024Updated last year
- Create and deploy virtual-experiments - co-processing computational workflows☆10Jan 28, 2026Updated last month
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆199Feb 26, 2026Updated last week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆411Feb 23, 2026Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆129Feb 26, 2026Updated last week
- An implementation of HIP that works on CPUs, across OSes.☆131Mar 19, 2024Updated last year
- OpenACC* to OpenMP* API assisting migration tool☆41Dec 15, 2025Updated 2 months ago
- LPGPU2 CodeXL power performance analysis and feedback tool for GPUs☆34Mar 14, 2019Updated 6 years ago
- This repository contains the results and code for the MLPerf™ Training v2.0 benchmark.☆29Feb 23, 2024Updated 2 years ago
- A C++ Library for Influence Maximization☆35Dec 10, 2024Updated last year
- PyTorch examples for NERSC systems☆34Oct 28, 2024Updated last year
- A system validation and diagnostics tool for monitoring, stress testing, detecting, and troubleshooting issues impacting AMD GPUs in high…☆96Feb 26, 2026Updated last week