ROCm / hipCaffeLinks
(Deprecated) hipCaffe: the HIP port of Caffe
☆124Updated last year
Alternatives and similar repositories for hipCaffe
Users that are interested in hipCaffe are comparing it to the libraries listed below
Sorting:
- CL Offline Compiler : Compile OpenCL kernels to HSAIL☆50Updated 8 years ago
- The repo is obsolete. Use at your own risk.☆12Updated 7 years ago
- MIOpenGEMM is now deprecated☆61Updated 2 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆137Updated 8 years ago
- HCC is an Open Source, Optimizing C++ Compiler for Heterogeneous Compute currently for the ROCm GPU Computing Platform☆436Updated 5 years ago
- CLTune: An automatic OpenCL & CUDA kernel tuner☆182Updated 3 years ago
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆299Updated 7 years ago
- Easy to run kernels using OpenCL☆187Updated 8 months ago
- OpenCL support for TensorFlow via SYCL☆65Updated 7 years ago
- Compute Library for Deep Neural Networks (clDNN)☆575Updated 2 years ago
- OpenCL Torch☆146Updated 7 years ago
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆64Updated 6 years ago
- portDNN is a library implementing neural network algorithms written using SYCL☆113Updated last year
- GPUOCelot: A dynamic compilation framework for PTX☆288Updated 2 years ago
- AMD OpenVX Core -- a sub-module of amdovx-modules:☆147Updated 6 years ago
- Library for fast image convolution in neural networks on Intel Architecture☆30Updated 8 years ago
- OpenCL support for TensorFlow☆477Updated 8 years ago
- Caffe: a fast open framework for deep learning. With OpenCL and CUDA support.☆86Updated 7 years ago
- Intel® Optimization for Chainer*, a Chainer module providing numpy like API and DNN acceleration using MKL-DNN.☆173Updated this week
- The NNEF Tools repository contains tools to generate and consume NNEF documents☆230Updated last week
- An OpenCL backend for torch.☆300Updated 9 years ago
- Optimized half precision gemm assembly kernels (deprecated due to ROCm)☆47Updated 8 years ago
- Open single and half precision gemm implementations☆394Updated 2 years ago
- Code appendix to an OpenCL matrix-multiplication tutorial☆178Updated 8 years ago
- A CUDNN minimal deep learning training code sample using LeNet.☆268Updated 2 years ago
- OpenCL port of TensorFlow using SYCL, generic instructions for building are here:☆61Updated 5 years ago
- Collection of samples and utilities for using ComputeCpp, Codeplay's SYCL implementation☆325Updated 2 years ago
- Original Python version of Intel® Nervana™ Graph☆215Updated 3 years ago
- Intel(R) Machine Learning Scaling Library is a library providing an efficient implementation of communication patterns used in deep learn…☆108Updated 2 years ago
- Build NVIDIA® CUDA™ code for OpenCL™ 1.2 devices☆869Updated 8 months ago