benoitsteiner / tensorflow-xsmmLinks
Improved performance for TensorFlow on Intel hardware.
☆13Updated 7 years ago
Alternatives and similar repositories for tensorflow-xsmm
Users that are interested in tensorflow-xsmm are comparing it to the libraries listed below
Sorting:
- Intel(R) Machine Learning Scaling Library is a library providing an efficient implementation of communication patterns used in deep learn…☆108Updated 2 years ago
- A Raspberry Pi GPU-accelerated implementation of the GEMM matrix-multiply function☆88Updated 11 years ago
- ☆10Updated 3 years ago
- Library for fast image convolution in neural networks on Intel Architecture☆31Updated 8 years ago
- Machine Learning Toolkit for Extreme Scale (MaTEx)☆111Updated 7 years ago
- OpenCL backend for Torch nn neural networks library☆126Updated 9 years ago
- Python wrappers for the NVIDIA cuDNN libraries☆141Updated 8 years ago
- Original Python version of Intel® Nervana™ Graph☆215Updated 2 years ago
- An Architecture-level Fault Injection Tool for GPU Application Resilience Evaluations☆18Updated 5 years ago
- (Deprecated) hipCaffe: the HIP port of Caffe☆124Updated last year
- Convolutional neural networks C++ framework with CPU and GPU (CUDA) backends☆182Updated 6 years ago
- Intu is a Cognitive Embodiment Middleware for AI on the edge.☆31Updated 9 months ago
- A GPU (CUDA) based Artificial Neural Network library☆109Updated 4 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆137Updated 8 years ago
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆64Updated 6 years ago
- Caffe deep learning framework - optimized for Xeon Phi☆14Updated 10 years ago
- Implements a message passing interface (MPI) wrapper that makes it easy to do massively parallel computations inside the Torch deep-learn…☆110Updated 6 years ago
- Benchmarking Keras application network performance☆52Updated 6 years ago
- TTC: A high-performance Compiler for Tensor Transpositions☆21Updated 7 years ago
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆298Updated 6 years ago
- A fast deep neural network library (CPU) for speech recognition☆84Updated 6 years ago
- Cairo lua bindings with extensions for torch☆15Updated 9 years ago
- Catamount is a compute graph analysis tool to load, construct, and modify deep learning models and to symbolically analyze their compute …☆14Updated 4 years ago
- The Operator Vectorization Library, or OVL, is a python productivity library for defining high performance custom operators for the Tenso…☆68Updated 8 years ago
- Scientific library for high-precision computations and research☆49Updated 7 years ago
- Code examples for CUDA and OpenACC☆34Updated last year
- Caffe: a fast open framework for deep learning. With OpenCL and CUDA support.☆86Updated 7 years ago
- A GPU / CPU implementation of a feed forward neural network☆31Updated 10 years ago
- ArrayFire's Machine Learning Library.☆105Updated 6 years ago
- TensorFlow-nGraph bridge☆136Updated 4 years ago