tutorial to optimize GEMM performance on android
☆51Feb 17, 2016Updated 10 years ago
Alternatives and similar repositories for gemm-android
Users that are interested in gemm-android are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- profiling gemm on android☆10Apr 1, 2016Updated 10 years ago
- ICME 2016 "Learning Deep Representation from Coarse to Fine for Face Alignment"☆30Oct 29, 2018Updated 7 years ago
- Low-precision matrix multiplication☆1,838Jan 29, 2024Updated 2 years ago
- Demos interesting image-in, image-out networks running on both NVIDIA and AMD GPUs, with NNVM☆49Nov 21, 2017Updated 8 years ago
- SqueezeNet: AlexNet-level accuracy with 50x fewer parameters☆21Mar 2, 2016Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- GPU Automatically Tuned Linear Algebra Software☆28Sep 1, 2015Updated 10 years ago
- A faster re-implementation of the FAST-9 algorithm (C++, with C bindings available)☆14Feb 1, 2017Updated 9 years ago
- The benchmark of ncnn that is a high-performance neural network inference framework optimized for the mobile platform☆72Mar 8, 2019Updated 7 years ago
- CK-NNTest: collaboratively validating, benchmarking and optimizing neural net operators across platforms, frameworks and datasets☆15Jul 10, 2021Updated 4 years ago
- Open single and half precision gemm implementations☆397Apr 2, 2023Updated 3 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆137Apr 20, 2017Updated 8 years ago
- Tuned OpenCL BLAS☆1,171Apr 3, 2026Updated last week
- Face detection with alignment from unconstrained photos☆12Sep 29, 2015Updated 10 years ago
- A program that times various techniques for performing a moving median filter (sometimes called rolling median, or streaming median)☆11Feb 13, 2016Updated 10 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Train Neuronal networks to automate your home☆19Mar 1, 2023Updated 3 years ago
- A light-weight deep convolutional neural network for face detection☆13Mar 8, 2019Updated 7 years ago
- ☆16Nov 21, 2017Updated 8 years ago
- CLTune: An automatic OpenCL & CUDA kernel tuner☆185Dec 12, 2022Updated 3 years ago
- Open Source Library for GPU-Accelerated Execution of Trained Deep Convolutional Neural Networks on Android☆543Apr 12, 2017Updated 8 years ago
- Acceleration package for neural networks on multi-core CPUs☆1,704Jun 11, 2024Updated last year
- DLPack for Tensorflow☆35Apr 13, 2020Updated 5 years ago
- Deep CNN on Android☆30Feb 26, 2017Updated 9 years ago
- Amalgamation and go binding☆63Nov 11, 2015Updated 10 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Proof-of-Concept CNN in Halide☆22Aug 4, 2016Updated 9 years ago
- Portable 128-bit SIMD intrinsics☆59Jul 4, 2023Updated 2 years ago
- a software library containing BLAS functions written in OpenCL☆864Aug 2, 2024Updated last year
- NNVM for ROCm Examples☆19Nov 22, 2017Updated 8 years ago
- The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologi…☆3,126Apr 2, 2026Updated last week
- An Example of MXNet Models Comilation and Deployment with NNVM in C++☆16Apr 25, 2018Updated 7 years ago
- Neural Style Transfer with Caffe2 on your Android phone☆82Mar 28, 2019Updated 7 years ago
- Cross platform (Visual Studio,Xcode,clang,gcc...) testsuite for OpenCL. Based on CMake and LLVM's lit test framework.☆18Dec 10, 2017Updated 8 years ago
- C++ training and testing code for an SVM using Vlfeat fisher vectors together with possible other features.☆14Jun 29, 2016Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆10May 4, 2023Updated 2 years ago
- ☆404Mar 15, 2019Updated 7 years ago
- Automatically exported from code.google.com/p/math-neon☆40Apr 20, 2015Updated 10 years ago
- CUDA Extension Wrangler☆26Aug 21, 2019Updated 6 years ago
- BLISlab: A Sandbox for Optimizing GEMM☆561Jun 17, 2021Updated 4 years ago
- Winograd minimal convolution algorithm generator for convolutional neural networks.☆627Feb 9, 2026Updated 2 months ago
- Simple test of ARM NEON code. Performs a blit to the framebuffer.☆15Jul 23, 2013Updated 12 years ago