ROCm / hipCaffe
(Deprecated) hipCaffe: the HIP port of Caffe
☆124Updated 9 months ago
Alternatives and similar repositories for hipCaffe:
Users that are interested in hipCaffe are comparing it to the libraries listed below
- The repo is obsolete. Use at your own risk.☆12Updated 6 years ago
- CL Offline Compiler : Compile OpenCL kernels to HSAIL☆50Updated 7 years ago
- Easy to run kernels using OpenCL☆184Updated 7 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 7 years ago
- MIOpenGEMM is now deprecated☆62Updated last year
- HCC is an Open Source, Optimizing C++ Compiler for Heterogeneous Compute currently for the ROCm GPU Computing Platform☆434Updated 4 years ago
- OpenCL Torch☆147Updated 6 years ago
- An OpenCL backend for torch.☆290Updated 8 years ago
- portDNN is a library implementing neural network algorithms written using SYCL☆110Updated 8 months ago
- OpenCL support for TensorFlow via SYCL☆65Updated 6 years ago
- an OpenCL based software library containing random number generation functions☆135Updated 3 years ago
- CLTune: An automatic OpenCL & CUDA kernel tuner☆173Updated 2 years ago
- Library for fast image convolution in neural networks on Intel Architecture☆29Updated 7 years ago
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆296Updated 6 years ago
- The NNEF Tools repository contains tools to generate and consume NNEF documents☆223Updated this week
- Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Juli…☆28Updated 5 years ago
- Stretching GPU performance for GEMMs and tensor contractions.☆233Updated this week
- a software library containing Sparse functions written in OpenCL☆173Updated 4 years ago
- Next generation BLAS implementation for ROCm platform☆360Updated this week
- A CUDNN minimal deep learning training code sample using LeNet.☆264Updated last year
- Optimized half precision gemm assembly kernels (deprecated due to ROCm)☆47Updated 7 years ago
- High Efficiency Convolution Kernel for Maxwell GPU Architecture☆134Updated 7 years ago
- AMD OpenVX Core -- a sub-module of amdovx-modules:☆149Updated 6 years ago
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆76Updated 4 years ago
- Collection of samples and utilities for using ComputeCpp, Codeplay's SYCL implementation☆323Updated last year
- ROCm Device Libraries☆97Updated 9 months ago
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆63Updated 5 years ago
- AMD's graph optimization engine.☆208Updated this week
- This is a Experimental version of OpenCL by AMD Research, we now recommend you to use The official BVLC Caffe OpenCL branch is over at …☆521Updated 6 years ago
- The SHOC Benchmark Suite☆249Updated 3 years ago