rocmarchive / realcaffe2
The repo is obsolete. Use at your own risk.
☆12Updated 6 years ago
Alternatives and similar repositories for realcaffe2:
Users that are interested in realcaffe2 are comparing it to the libraries listed below
- (Deprecated) hipCaffe: the HIP port of Caffe☆124Updated 10 months ago
- MIOpenGEMM is now deprecated☆62Updated last year
- portDNN is a library implementing neural network algorithms written using SYCL☆111Updated 10 months ago
- A thin wrapper around miOpen and cuDNN☆41Updated last year
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 7 years ago
- RAND library for HIP programming language☆117Updated this week
- ROCm OpenCL Compiler Tool Driver☆24Updated 5 years ago
- Optimized half precision gemm assembly kernels (deprecated due to ROCm)☆47Updated 7 years ago
- ROCm Parallel Primitives☆170Updated this week
- CLTune: An automatic OpenCL & CUDA kernel tuner☆177Updated 2 years ago
- ROCm Device Libraries☆97Updated 10 months ago
- Multi-GPU Computing Benchmark Suite (CUDA)☆42Updated 7 years ago
- Tools and extensions for CUDA profiling☆65Updated 5 years ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆105Updated 7 years ago
- The Radeon Compute Profiler (RCP) is a performance analysis tool that gathers data from the API run-time and GPU for OpenCL™ and ROCm/HSA…☆86Updated 4 years ago
- HIP back-end for Thrust that has been replaced by rocThrust☆28Updated last year
- HCC Sample Applications☆13Updated 8 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- ☆68Updated 2 years ago
- Next generation FFT implementation for ROCm☆188Updated this week
- CUDA GDB☆199Updated last month
- Intel® GPU Compute Samples☆105Updated 2 weeks ago
- Stretching GPU performance for GEMMs and tensor contractions.☆233Updated last week
- Full-speed Array of Structures access☆164Updated last year
- The SHOC Benchmark Suite☆250Updated 3 years ago
- CL Offline Compiler : Compile OpenCL kernels to HSAIL☆50Updated 7 years ago
- ROC profiler library. Profiling with perf-counters and derived metrics.☆137Updated last week
- This repository contains the results and code for the MLPerf™ Training v0.5 benchmark.☆35Updated last year
- Cooperative Primitives for CUDA C++ Kernel Authors. This repository contains CUB PRs from Q4 2019 until Q4 2020.☆22Updated 4 years ago
- Python bindings for NVTX☆66Updated last year