ROCm / hipDNN
A thin wrapper around miOpen and cuDNN
☆38Updated last year
Related projects ⓘ
Alternatives and complementary repositories for hipDNN
- MIOpenGEMM is now deprecated☆61Updated last year
- RAND library for HIP programming language☆111Updated this week
- Reusable software components for ROCm developers☆79Updated this week
- ☆42Updated 9 months ago
- HIP back-end for Thrust that has been replaced by rocThrust☆28Updated last year
- portDNN is a library implementing neural network algorithms written using SYCL☆108Updated 6 months ago
- ROCm Parallel Primitives☆162Updated this week
- ROCm Device Libraries☆98Updated 6 months ago
- ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs☆75Updated last week
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆99Updated this week
- ROC profiler library. Profiling with perf-counters and derived metrics.☆130Updated this week
- ROCm BLAS marshalling library☆118Updated this week
- Next generation FFT implementation for ROCm☆176Updated this week
- OpenCL/SPIR-V implementation of HIP☆104Updated 2 years ago
- Bandwidth test for ROCm☆47Updated 2 weeks ago
- The Radeon Compute Profiler (RCP) is a performance analysis tool that gathers data from the API run-time and GPU for OpenCL™ and ROCm/HSA…☆85Updated 4 years ago
- ROCm SPARSE marshalling library☆69Updated this week
- hipFFT is a FFT marshalling library.☆54Updated this week
- SYCL-ML is a C++ library, implementing classical machine learning algorithms using SYCL.☆64Updated 4 years ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆99Updated 7 years ago
- ☆35Updated 9 months ago
- CMake modules used within the ROCm libraries☆58Updated this week
- Compute applications.☆25Updated 4 years ago
- Next generation SPARSE implementation for ROCm platform☆116Updated this week
- ☆75Updated last year
- Stretching GPU performance for GEMMs and tensor contractions.☆223Updated this week
- Next generation LAPACK implementation for ROCm platform☆94Updated this week
- Asynchronous Task and Memory Interface, or ATMI, is a runtime framework and programming model for heterogeneous CPU-GPU systems. It provi…☆66Updated 9 months ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆43Updated 10 months ago
- A system validation and diagnostics tool for monitoring, stress testing, detecting, and troubleshooting issues impacting AMD GPUs in high…☆66Updated this week