tutorial to optimize GEMM performance on android
☆51Feb 17, 2016Updated 10 years ago
Alternatives and similar repositories for gemm-android
Users that are interested in gemm-android are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ICME 2016 "Learning Deep Representation from Coarse to Fine for Face Alignment"☆30Oct 29, 2018Updated 7 years ago
- Low-precision matrix multiplication☆1,843Jan 29, 2024Updated 2 years ago
- Demos interesting image-in, image-out networks running on both NVIDIA and AMD GPUs, with NNVM☆49Nov 21, 2017Updated 8 years ago
- SqueezeNet: AlexNet-level accuracy with 50x fewer parameters☆21Mar 2, 2016Updated 10 years ago
- A faster re-implementation of the FAST-9 algorithm (C++, with C bindings available)☆14Feb 1, 2017Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- CK-NNTest: collaboratively validating, benchmarking and optimizing neural net operators across platforms, frameworks and datasets☆15Jul 10, 2021Updated 4 years ago
- Kernel Fusion and Runtime Compilation Based on NNVM☆72Nov 21, 2016Updated 9 years ago
- Porting caffe to android platform☆10Jul 16, 2016Updated 9 years ago
- Open single and half precision gemm implementations☆397Apr 2, 2023Updated 3 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆137Apr 20, 2017Updated 9 years ago
- Tuned OpenCL BLAS☆1,174Apr 13, 2026Updated last month
- ☆20Dec 15, 2023Updated 2 years ago
- Face detection with alignment from unconstrained photos☆12Sep 29, 2015Updated 10 years ago
- A program that times various techniques for performing a moving median filter (sometimes called rolling median, or streaming median)☆11Feb 13, 2016Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Train Neuronal networks to automate your home☆19Mar 1, 2023Updated 3 years ago
- A simple baseline model set using MXNet for Kaggle StateFarm driver position identification☆27Jul 1, 2016Updated 9 years ago
- This is an read-only mirror of the gem5 simulator. The upstream repository is stored in https://gem5.googlesource.com, code reviews shoul…☆19Aug 21, 2021Updated 4 years ago
- A light-weight deep convolutional neural network for face detection☆13Mar 8, 2019Updated 7 years ago
- ☆16Nov 21, 2017Updated 8 years ago
- CLTune: An automatic OpenCL & CUDA kernel tuner☆185Dec 12, 2022Updated 3 years ago
- Open Source Library for GPU-Accelerated Execution of Trained Deep Convolutional Neural Networks on Android☆544Apr 12, 2017Updated 9 years ago
- Acceleration package for neural networks on multi-core CPUs☆1,706Jun 11, 2024Updated last year
- DLPack for Tensorflow☆35Apr 13, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Deep CNN on Android☆30Feb 26, 2017Updated 9 years ago
- Amalgamation and go binding☆63Nov 11, 2015Updated 10 years ago
- Proof-of-Concept CNN in Halide☆22Aug 4, 2016Updated 9 years ago
- Companion source code for GTC 2014 talk☆11Mar 25, 2014Updated 12 years ago
- NNVM for ROCm Examples☆19Nov 22, 2017Updated 8 years ago
- Portable 128-bit SIMD intrinsics☆60Jul 4, 2023Updated 2 years ago
- An Example of MXNet Models Comilation and Deployment with NNVM in C++☆16Apr 25, 2018Updated 8 years ago
- Neural Style Transfer with Caffe2 on your Android phone☆82Mar 28, 2019Updated 7 years ago
- Cross platform (Visual Studio,Xcode,clang,gcc...) testsuite for OpenCL. Based on CMake and LLVM's lit test framework.☆18Dec 10, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A powerful Laravel storage driver that enables seamless synchronization of files across multiple disks, with an integrated cache disk for…☆15Nov 11, 2025Updated 6 months ago
- C++ training and testing code for an SVM using Vlfeat fisher vectors together with possible other features.☆14Jun 29, 2016Updated 9 years ago
- ☆10May 4, 2023Updated 3 years ago
- ☆404Mar 15, 2019Updated 7 years ago
- Automatically exported from code.google.com/p/math-neon☆40Apr 20, 2015Updated 11 years ago
- BLISlab: A Sandbox for Optimizing GEMM☆562Jun 17, 2021Updated 4 years ago
- Winograd minimal convolution algorithm generator for convolutional neural networks.☆628Feb 9, 2026Updated 3 months ago