rocmarchive / realcaffe2
The repo is obsolete. Use at your own risk.
☆12Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for realcaffe2
- (Deprecated) hipCaffe: the HIP port of Caffe☆124Updated 6 months ago
- MIOpenGEMM is now deprecated☆61Updated last year
- A thin wrapper around miOpen and cuDNN☆38Updated last year
- ROCm OpenCL Compiler Tool Driver☆24Updated 4 years ago
- Optimized half precision gemm assembly kernels (deprecated due to ROCm)☆47Updated 7 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 7 years ago
- This repository contains the results and code for the MLPerf™ Training v0.5 benchmark.☆35Updated last year
- CL Offline Compiler : Compile OpenCL kernels to HSAIL☆50Updated 7 years ago
- portDNN is a library implementing neural network algorithms written using SYCL☆108Updated 6 months ago
- ROCm Device Libraries☆98Updated 6 months ago
- ROCm Parallel Primitives☆162Updated this week
- Stretching GPU performance for GEMMs and tensor contractions.☆223Updated this week
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆99Updated 7 years ago
- RAND library for HIP programming language☆111Updated this week
- Python bindings for NVTX☆66Updated last year
- The Radeon Compute Profiler (RCP) is a performance analysis tool that gathers data from the API run-time and GPU for OpenCL™ and ROCm/HSA…☆85Updated 4 years ago
- High Performance Linpack for GPUs (Using OpenCL, CUDA, CAL)☆88Updated 9 years ago
- Library for fast image convolution in neural networks on Intel Architecture☆29Updated 7 years ago
- Kernel Fusion and Runtime Compilation Based on NNVM☆69Updated 8 years ago
- HIP back-end for Thrust that has been replaced by rocThrust☆28Updated last year
- CLTune: An automatic OpenCL & CUDA kernel tuner☆170Updated last year
- Next generation FFT implementation for ROCm☆176Updated this week
- ROCm BLAS marshalling library☆118Updated this week
- HCC Sample Applications☆13Updated 7 years ago
- Intel® GPU Compute Samples☆97Updated this week
- Collective Knowledge repository for NVIDIA's TensorRT☆37Updated 3 years ago
- ☆14Updated 5 years ago
- The SHOC Benchmark Suite☆247Updated 2 years ago
- Tools and extensions for CUDA profiling☆63Updated 4 years ago