tutorial to optimize GEMM performance on android
☆51Feb 17, 2016Updated 10 years ago
Alternatives and similar repositories for gemm-android
Users that are interested in gemm-android are comparing it to the libraries listed below
Sorting:
- profiling gemm on android☆10Apr 1, 2016Updated 9 years ago
- ICME 2016 "Learning Deep Representation from Coarse to Fine for Face Alignment"☆30Oct 29, 2018Updated 7 years ago
- Low-precision matrix multiplication☆1,832Jan 29, 2024Updated 2 years ago
- ☆10Sep 10, 2025Updated 6 months ago
- SqueezeNet: AlexNet-level accuracy with 50x fewer parameters☆21Mar 2, 2016Updated 10 years ago
- GPU Automatically Tuned Linear Algebra Software☆28Sep 1, 2015Updated 10 years ago
- The benchmark of ncnn that is a high-performance neural network inference framework optimized for the mobile platform☆72Mar 8, 2019Updated 7 years ago
- CK-NNTest: collaboratively validating, benchmarking and optimizing neural net operators across platforms, frameworks and datasets☆15Jul 10, 2021Updated 4 years ago
- Kernel Fusion and Runtime Compilation Based on NNVM☆73Nov 21, 2016Updated 9 years ago
- Porting caffe to android platform☆10Jul 16, 2016Updated 9 years ago
- Open single and half precision gemm implementations☆397Apr 2, 2023Updated 2 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆137Apr 20, 2017Updated 8 years ago
- Tuned OpenCL BLAS☆1,169Feb 1, 2026Updated last month
- ☆20Dec 15, 2023Updated 2 years ago
- Face detection with alignment from unconstrained photos☆12Sep 29, 2015Updated 10 years ago
- Train Neuronal networks to automate your home☆19Mar 1, 2023Updated 3 years ago
- A simple baseline model set using MXNet for Kaggle StateFarm driver position identification☆27Jul 1, 2016Updated 9 years ago
- A light-weight deep convolutional neural network for face detection☆13Mar 8, 2019Updated 7 years ago
- ☆16Nov 21, 2017Updated 8 years ago
- Open Source Library for GPU-Accelerated Execution of Trained Deep Convolutional Neural Networks on Android☆542Apr 12, 2017Updated 8 years ago
- Acceleration package for neural networks on multi-core CPUs☆1,702Jun 11, 2024Updated last year
- DLPack for Tensorflow☆35Apr 13, 2020Updated 5 years ago
- Deep CNN on Android☆30Feb 26, 2017Updated 9 years ago
- Amalgamation and go binding☆63Nov 11, 2015Updated 10 years ago
- Proof-of-Concept CNN in Halide☆22Aug 4, 2016Updated 9 years ago
- Portable 128-bit SIMD intrinsics☆59Jul 4, 2023Updated 2 years ago
- Companion source code for GTC 2014 talk☆11Mar 25, 2014Updated 11 years ago
- NNVM for ROCm Examples☆19Nov 22, 2017Updated 8 years ago
- Personal collection of references for high performance mixed precision training.☆41Oct 21, 2019Updated 6 years ago
- The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologi…☆3,122Updated this week
- An Example of MXNet Models Comilation and Deployment with NNVM in C++☆16Apr 25, 2018Updated 7 years ago
- Neural Style Transfer with Caffe2 on your Android phone☆82Mar 28, 2019Updated 6 years ago
- Cross platform (Visual Studio,Xcode,clang,gcc...) testsuite for OpenCL. Based on CMake and LLVM's lit test framework.☆18Dec 10, 2017Updated 8 years ago
- Wrap ffmpge api by c++ and register in qml. You can use the qml type VideoItem such as QtMutilMedia.☆14Nov 16, 2015Updated 10 years ago
- Fresh Python Mesos Scheduler and Executor driver☆18Oct 19, 2017Updated 8 years ago
- ☆10May 4, 2023Updated 2 years ago
- Automatically exported from code.google.com/p/math-neon☆40Apr 20, 2015Updated 10 years ago
- CUDA Extension Wrangler☆26Aug 21, 2019Updated 6 years ago
- ☆17Aug 22, 2021Updated 4 years ago