baidu-research / DeepBench
Benchmarking Deep Learning operations on different hardware
☆1,077Updated 3 years ago
Alternatives and similar repositories for DeepBench:
Users that are interested in DeepBench are comparing it to the libraries listed below
- ☆1,656Updated 6 years ago
- Acceleration package for neural networks on multi-core CPUs☆1,681Updated 7 months ago
- Fast Recurrent Networks Library☆579Updated 8 years ago
- Easy benchmarking of all publicly accessible implementations of convnets☆2,682Updated 7 years ago
- Low-precision matrix multiplication☆1,787Updated 11 months ago
- Reference implementations of MLPerf™ training benchmarks☆1,636Updated this week
- A domain specific language to express machine learning workloads.☆1,759Updated last year
- Compute Library for Deep Neural Networks (clDNN)☆574Updated 2 years ago
- A benchmark framework for Tensorflow☆1,148Updated last year
- nGraph has moved to OpenVINO☆1,350Updated 4 years ago
- ☆573Updated 6 years ago
- Collective communications library with various primitives for multi-machine training.☆1,253Updated 2 weeks ago
- Original Python version of Intel® Nervana™ Graph☆215Updated 2 years ago
- Caffe: a fast open framework for deep learning.☆674Updated last year
- This is a Experimental version of OpenCL by AMD Research, we now recommend you to use The official BVLC Caffe OpenCL branch is over at …☆519Updated 6 years ago
- Open single and half precision gemm implementations☆373Updated last year
- Efficient GPU kernels for block-sparse matrix multiplication and convolution☆1,030Updated last year
- This fork of BVLC/Caffe is dedicated to improving performance of this deep learning framework when running on CPU, in particular Intel® X…☆846Updated 2 years ago
- Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators☆1,533Updated 5 years ago
- Automatically exported from code.google.com/p/cuda-convnet2☆785Updated 9 years ago
- ImageNet classification using binary Convolutional Neural Networks☆857Updated 7 years ago
- Assembler for NVIDIA Maxwell architecture☆963Updated 2 years ago
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆296Updated 6 years ago
- THE Deep Learning Benchmarks☆351Updated 8 years ago
- Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning☆1,111Updated 5 years ago
- A CUDNN minimal deep learning training code sample using LeNet.☆265Updated last year
- ☆375Updated 7 years ago
- A common bricks library for building scalable and portable distributed machine learning.☆868Updated 7 months ago
- Winograd minimal convolution algorithm generator for convolutional neural networks.☆610Updated 4 years ago
- oneAPI Deep Neural Network Library (oneDNN)☆3,677Updated this week