ROCm / hipDNN
A thin wrapper around miOpen and cuDNN
☆42Updated last year
Alternatives and similar repositories for hipDNN
Users that are interested in hipDNN are comparing it to the libraries listed below
Sorting:
- MIOpenGEMM is now deprecated☆62Updated last year
- Reusable software components for ROCm developers☆83Updated this week
- ROCm Parallel Primitives☆171Updated last week
- portDNN is a library implementing neural network algorithms written using SYCL☆113Updated 11 months ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆104Updated 7 years ago
- OpenCL/SPIR-V implementation of HIP☆104Updated 2 years ago
- ROCm BLAS marshalling library☆140Updated this week
- ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs☆83Updated 2 weeks ago
- Asynchronous Task and Memory Interface, or ATMI, is a runtime framework and programming model for heterogeneous CPU-GPU systems. It provi…☆67Updated last year
- ROCm Device Libraries☆97Updated last year
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆108Updated last week
- An implementation of HIP that works on CPUs, across OSes.☆116Updated last year
- RAND library for HIP programming language☆118Updated last week
- SYCL Benchmark Suite☆64Updated 2 months ago
- The Radeon Compute Profiler (RCP) is a performance analysis tool that gathers data from the API run-time and GPU for OpenCL™ and ROCm/HSA…☆87Updated 4 years ago
- Implementation of AMD HIP for CPUs☆22Updated 4 years ago
- Stretching GPU performance for GEMMs and tensor contractions.☆237Updated this week
- Next generation FFT implementation for ROCm☆191Updated last week
- HIP back-end for Thrust that has been replaced by rocThrust☆28Updated 2 years ago
- SYCL-ML is a C++ library, implementing classical machine learning algorithms using SYCL.☆66Updated 5 years ago
- hipFFT is a FFT marshalling library.☆63Updated last week
- ROC profiler library. Profiling with perf-counters and derived metrics.☆144Updated 2 weeks ago
- Examples for HIP☆206Updated 5 months ago
- Compute applications.☆24Updated 5 years ago
- Next generation SPARSE implementation for ROCm platform☆122Updated last week
- SYCL Open Source Specification☆134Updated this week
- AMD’s C++ library for accelerating tensor primitives☆40Updated this week
- ROCm SPARSE marshalling library☆67Updated this week
- Next generation LAPACK implementation for ROCm platform☆100Updated this week
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago