strin / mocha-gemm-profile
profiling gemm on android
☆10Updated 8 years ago
Related projects ⓘ
Alternatives and complementary repositories for mocha-gemm-profile
- tutorial to optimize GEMM performance on android☆51Updated 8 years ago
- a C++ wrapper of Caffe and mxnet to make predictions☆50Updated 6 years ago
- Demos interesting image-in, image-out networks running on both NVIDIA and AMD GPUs, with NNVM☆49Updated 7 years ago
- Caffe with NNPACK integration☆59Updated 8 years ago
- Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets☆29Updated 8 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 7 years ago
- ☆24Updated 6 years ago
- detection-developing☆20Updated 10 years ago
- Tools to convert Caffe models to neon's serialization format☆39Updated last year
- a mxnet multi-task tutorial☆33Updated 8 years ago
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆63Updated 5 years ago
- Ristretto: Caffe-based approximation of convolutional neural networks.☆30Updated 6 years ago
- RenderScript based implementation of Convolutional Neural Networks for Android phones☆52Updated 6 years ago
- Face detection evaluation☆61Updated 7 years ago
- Collective Knowledge repository for NVIDIA's TensorRT☆37Updated 3 years ago
- ☆23Updated 8 years ago
- Caffe: a fast open framework for deep learning.☆14Updated 8 years ago
- Optimized half precision gemm assembly kernels (deprecated due to ROCm)☆47Updated 7 years ago
- Heterogeneous Run Time version of MXNet. Added heterogeneous capabilities to the MXNet, uses heterogeneous computing infrastructure frame…☆72Updated 6 years ago
- Torch FFI-bindings for NNPACK☆30Updated 7 years ago
- Proof-of-Concept CNN in Halide☆21Updated 8 years ago
- Caffe: a fast open framework for deep learning. With OpenCL and CUDA support.☆86Updated 6 years ago
- Simple MXNet sequence-to-sequence model (neural machine translation)☆24Updated 6 years ago
- Darwin: A Framework for Machine Learning Research and Development☆54Updated 3 years ago
- CNN(Convolutional neural network) forward code which requires little dependency(Opencv, TBB-optional) and is easy to run on Windows(using…☆34Updated 8 years ago
- ☆37Updated 9 years ago