baidu-research / DeepBenchLinks
Benchmarking Deep Learning operations on different hardware
☆1,100Updated 4 years ago
Alternatives and similar repositories for DeepBench
Users that are interested in DeepBench are comparing it to the libraries listed below
Sorting:
- Acceleration package for neural networks on multi-core CPUs☆1,701Updated last year
- ☆1,655Updated 7 years ago
- Reference implementations of MLPerf® training benchmarks☆1,722Updated last week
- Low-precision matrix multiplication☆1,817Updated last year
- nGraph has moved to OpenVINO☆1,343Updated 5 years ago
- Open single and half precision gemm implementations☆394Updated 2 years ago
- A domain specific language to express machine learning workloads.☆1,760Updated 2 years ago
- Easy benchmarking of all publicly accessible implementations of convnets☆2,691Updated 8 years ago
- ☆600Updated 7 years ago
- Automatically exported from code.google.com/p/cuda-convnet2☆812Updated 9 years ago
- Fast Recurrent Networks Library☆577Updated 9 years ago
- Collective communications library with various primitives for multi-machine training.☆1,367Updated 3 weeks ago
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆299Updated 6 years ago
- ☆370Updated 8 years ago
- This fork of BVLC/Caffe is dedicated to improving performance of this deep learning framework when running on CPU, in particular Intel® X…☆851Updated 3 years ago
- Benchmarking State-of-the-Art Deep Learning Software Tools☆171Updated 8 years ago
- A benchmark framework for Tensorflow☆1,147Updated 2 years ago
- This is a Experimental version of OpenCL by AMD Research, we now recommend you to use The official BVLC Caffe OpenCL branch is over at …☆529Updated 7 years ago
- Original Python version of Intel® Nervana™ Graph☆215Updated 3 years ago
- A CUDNN minimal deep learning training code sample using LeNet.☆269Updated 2 years ago
- ImageNet classification using binary Convolutional Neural Networks☆868Updated 7 years ago
- Compute Library for Deep Neural Networks (clDNN)☆575Updated 2 years ago
- Efficient GPU kernels for block-sparse matrix multiplication and convolution☆1,061Updated 2 years ago
- Winograd minimal convolution algorithm generator for convolutional neural networks.☆624Updated 5 years ago
- ATen: A TENsor library for C++11☆708Updated 5 years ago
- Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators☆1,547Updated 6 years ago
- Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning☆1,117Updated 6 years ago
- Deep Learning Benchmark for comparing the performance of DL frameworks, GPUs, and single vs half precision☆428Updated 5 years ago
- Caffe: a fast open framework for deep learning.☆668Updated 2 years ago
- Documentation for StreamExecutor open source proposal☆83Updated 9 years ago