ROCm / hipCaffe
(Deprecated) hipCaffe: the HIP port of Caffe
☆124Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for hipCaffe
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 7 years ago
- The repo is obsolete. Use at your own risk.☆12Updated 6 years ago
- MIOpenGEMM is now deprecated☆61Updated last year
- CL Offline Compiler : Compile OpenCL kernels to HSAIL☆50Updated 7 years ago
- HCC is an Open Source, Optimizing C++ Compiler for Heterogeneous Compute currently for the ROCm GPU Computing Platform☆433Updated 4 years ago
- Easy to run kernels using OpenCL☆183Updated 6 years ago
- CLTune: An automatic OpenCL & CUDA kernel tuner☆170Updated last year
- Library for fast image convolution in neural networks on Intel Architecture☆29Updated 7 years ago
- An OpenCL backend for torch.☆290Updated 8 years ago
- High Performance Linpack for GPUs (Using OpenCL, CUDA, CAL)☆88Updated 9 years ago
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆291Updated 5 years ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆99Updated 7 years ago
- a software library containing Sparse functions written in OpenCL☆173Updated 4 years ago
- Next generation BLAS implementation for ROCm platform☆346Updated this week
- A thin wrapper around miOpen and cuDNN☆38Updated last year
- OpenCL support for TensorFlow via SYCL☆65Updated 6 years ago
- This repository contains the results and code for the MLPerf™ Training v0.5 benchmark.☆35Updated last year
- an OpenCL based software library containing random number generation functions☆133Updated 3 years ago
- High Efficiency Convolution Kernel for Maxwell GPU Architecture☆134Updated 7 years ago
- portDNN is a library implementing neural network algorithms written using SYCL☆108Updated 6 months ago
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆219Updated this week
- The SHOC Benchmark Suite☆247Updated 2 years ago
- OpenCL Torch☆147Updated 6 years ago
- Stretching GPU performance for GEMMs and tensor contractions.☆223Updated this week
- ☆67Updated 2 years ago
- Open single and half precision gemm implementations☆374Updated last year
- AMD OpenVX Core -- a sub-module of amdovx-modules:☆149Updated 5 years ago