ctuning / ck-nntestLinks
CK-NNTest: collaboratively validating, benchmarking and optimizing neural net operators across platforms, frameworks and datasets
☆15Updated 4 years ago
Alternatives and similar repositories for ck-nntest
Users that are interested in ck-nntest are comparing it to the libraries listed below
Sorting:
- tutorial to optimize GEMM performance on android☆51Updated 9 years ago
- Optimizing Mobile Deep Learning on ARM GPU with TVM☆182Updated 7 years ago
- Heterogeneous Run Time version of MXNet. Added heterogeneous capabilities to the MXNet, uses heterogeneous computing infrastructure frame…☆72Updated 8 years ago
- ☆37Updated 8 years ago
- Neural Style Transfer with Caffe2 on your Android phone☆82Updated 6 years ago
- A quick view of high-performance convolution neural networks (CNNs) inference engines on mobile devices.☆151Updated 3 years ago
- Efficient forward propagation for BCNNs☆49Updated 8 years ago
- Collective Knowledge repository for NVIDIA's TensorRT☆37Updated 4 years ago
- A script to convert floating-point CNN models into generalized low-precision ShiftCNN representation☆57Updated 8 years ago
- The benchmark of ncnn that is a high-performance neural network inference framework optimized for the mobile platform☆72Updated 6 years ago
- This is a caffe implementation of ShuffleNet model☆15Updated 7 years ago
- Simple pruning example using Caffe☆33Updated 8 years ago
- This is a really simple compression of Caffe Model☆24Updated 8 years ago
- Top-1 Acc=61.0% on ImageNet, without any sacrificing compared with SqueezeNet v1.1.☆22Updated 8 years ago
- PyTorch -> ONNX -> TVM for autotuning☆24Updated 5 years ago
- Caffe implementation of ICCV 2017 & TPAMI 2018 paper - ThiNet☆46Updated 7 years ago
- Related Paper of Efficient Deep Neural Networks☆86Updated 4 years ago
- Computation using data flow graphs for scalable machine learning☆25Updated 8 years ago
- ☆209Updated 7 years ago
- ☆62Updated 7 years ago
- Caffe: a fast open framework for deep learning.☆14Updated 9 years ago
- Optimized half precision gemm assembly kernels (deprecated due to ROCm)☆47Updated 8 years ago
- Benchmark of TVM quantized model on CUDA☆112Updated 5 years ago
- nnvm&tvm example of cross compilation and deployment in Nvidia Jetson TX2 platform☆11Updated 7 years ago
- ☆18Updated 8 years ago
- Hopefully fast implementation of XNOR-Net in C, because, why not?☆27Updated 8 years ago
- flexible-gemm conv of deepcore☆17Updated 6 years ago
- Implement vgg16 model by ARM Compute Library☆32Updated 6 years ago
- ☆20Updated 2 years ago
- The quantization of CNN/LSTM☆11Updated 8 years ago