Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL
☆137Apr 20, 2017Updated 8 years ago
Alternatives and similar repositories for libdnn
Users that are interested in libdnn are comparing it to the libraries listed below
Sorting:
- experimental port of nervana neon kernels in OpenCL☆11Jul 24, 2016Updated 9 years ago
- OpenCL version of caffe☆18Dec 17, 2015Updated 10 years ago
- A portable high-level API with CUDA or OpenCL back-end☆56Oct 8, 2017Updated 8 years ago
- Open single and half precision gemm implementations☆397Apr 2, 2023Updated 2 years ago
- JOCLBlast - Java bindings for CLBlast☆15Mar 14, 2021Updated 5 years ago
- Torch bindings for FFmpeg (reading videos only)☆26Jul 13, 2016Updated 9 years ago
- Developer repository for ViennaCL. Visit http://viennacl.sourceforge.net/ for the latest releases.☆294Nov 22, 2021Updated 4 years ago
- Train Neuronal networks to automate your home☆19Mar 1, 2023Updated 3 years ago
- Caffe with NNPACK integration☆59Mar 24, 2016Updated 9 years ago
- Acceleration package for neural networks on multi-core CPUs☆1,702Jun 11, 2024Updated last year
- Caffe: a fast open framework for deep learning. With OpenCL and CUDA support.☆87Aug 18, 2018Updated 7 years ago
- PyTorch bindings for openai-gemm☆20Feb 6, 2017Updated 9 years ago
- Winograd minimal convolution algorithm generator for convolutional neural networks.☆627Feb 9, 2026Updated last month
- Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets☆28Apr 22, 2016Updated 9 years ago
- profiling gemm on android☆10Apr 1, 2016Updated 9 years ago
- ☆22Jun 22, 2016Updated 9 years ago
- Tuned OpenCL BLAS☆1,168Feb 1, 2026Updated last month
- Julia wrapper of CLBlast, a "tuned OpenCL BLAS library".☆14Aug 23, 2023Updated 2 years ago
- Torch FFI-bindings for NNPACK☆31May 26, 2017Updated 8 years ago
- Low-precision matrix multiplication☆1,832Jan 29, 2024Updated 2 years ago
- OpenCL support for TensorFlow☆475Oct 26, 2017Updated 8 years ago
- C++ interface for mxnet☆115Mar 22, 2017Updated 9 years ago
- fast and energy-efficient computation of HOG features☆17Oct 3, 2018Updated 7 years ago
- This is a Experimental version of OpenCL by AMD Research, we now recommend you to use The official BVLC Caffe OpenCL branch is over at …☆531Aug 31, 2018Updated 7 years ago
- a fully-differentiable graphical raytracer☆15Jul 29, 2015Updated 10 years ago
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆64Aug 15, 2019Updated 6 years ago
- Fork of https://source.codeaurora.org/quic/hexagon_nn/nnlib☆59Apr 10, 2023Updated 2 years ago
- OpenCL library to train deep convolutional neural networks☆880Jan 5, 2018Updated 8 years ago
- tutorial to optimize GEMM performance on android☆51Feb 17, 2016Updated 10 years ago
- ☆119Dec 20, 2017Updated 8 years ago
- Collective Knowledge workflow for Caffe to automate installation across diverse platforms and to collaboratively evaluate and optimize Ca…☆196Nov 6, 2019Updated 6 years ago
- auto-tuning momentum SGD optimizer☆23Jul 14, 2017Updated 8 years ago
- High Efficiency Convolution Kernel for Maxwell GPU Architecture☆137May 8, 2017Updated 8 years ago
- Library for fast image convolution in neural networks on Intel Architecture☆30Jun 25, 2017Updated 8 years ago
- Reproduction of MobileNetV2 using MXNet☆128Mar 15, 2019Updated 7 years ago
- Pose estimation of a 2D picture, given a 3D bundler output☆25May 18, 2015Updated 10 years ago
- ☆33May 17, 2016Updated 9 years ago
- a C++ wrapper of Caffe and mxnet to make predictions☆49Mar 7, 2018Updated 8 years ago
- An OpenCL backend for torch.☆302Nov 16, 2016Updated 9 years ago