ROCm / hipCaffeLinks
(Deprecated) hipCaffe: the HIP port of Caffe
☆124Updated last year
Alternatives and similar repositories for hipCaffe
Users that are interested in hipCaffe are comparing it to the libraries listed below
Sorting:
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆137Updated 8 years ago
- Easy to run kernels using OpenCL☆187Updated 6 months ago
- CL Offline Compiler : Compile OpenCL kernels to HSAIL☆50Updated 8 years ago
- The repo is obsolete. Use at your own risk.☆12Updated 7 years ago
- Compute Library for Deep Neural Networks (clDNN)☆576Updated 2 years ago
- MIOpenGEMM is now deprecated☆61Updated 2 years ago
- CLTune: An automatic OpenCL & CUDA kernel tuner☆182Updated 2 years ago
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆298Updated 6 years ago
- HCC is an Open Source, Optimizing C++ Compiler for Heterogeneous Compute currently for the ROCm GPU Computing Platform☆436Updated 5 years ago
- An OpenCL backend for torch.☆300Updated 8 years ago
- The NNEF Tools repository contains tools to generate and consume NNEF documents☆228Updated this week
- AMD OpenVX Core -- a sub-module of amdovx-modules:☆147Updated 6 years ago
- Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Juli…☆29Updated 5 years ago
- OpenCL Torch☆147Updated 6 years ago
- OpenCL support for TensorFlow☆477Updated 7 years ago
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆64Updated 6 years ago
- OpenCL support for TensorFlow via SYCL☆65Updated 7 years ago
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆77Updated 4 years ago
- Original Python version of Intel® Nervana™ Graph☆215Updated 3 years ago
- Library for fast image convolution in neural networks on Intel Architecture☆31Updated 8 years ago
- Optimized half precision gemm assembly kernels (deprecated due to ROCm)☆47Updated 8 years ago
- portDNN is a library implementing neural network algorithms written using SYCL☆113Updated last year
- Intel® Optimization for Chainer*, a Chainer module providing numpy like API and DNN acceleration using MKL-DNN.☆172Updated last week
- Collection of samples and utilities for using ComputeCpp, Codeplay's SYCL implementation☆325Updated 2 years ago
- GPUOCelot: A dynamic compilation framework for PTX☆289Updated 2 years ago
- Build NVIDIA® CUDA™ code for OpenCL™ 1.2 devices☆866Updated 6 months ago
- A CUDNN minimal deep learning training code sample using LeNet.☆269Updated 2 years ago
- Intel(R) Machine Learning Scaling Library is a library providing an efficient implementation of communication patterns used in deep learn…☆108Updated 2 years ago
- TensorFlow-nGraph bridge☆136Updated 4 years ago
- Open single and half precision gemm implementations☆392Updated 2 years ago