Makes ARM NEON documentation accessible (with examples)
☆410Apr 13, 2024Updated 2 years ago
Alternatives and similar repositories for neon-guide
Users that are interested in neon-guide are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- arm neon 相关文档和指令意义☆248May 21, 2019Updated 6 years ago
- Simple test code to benchmark the VFP floating point or NEON units of ARM processors☆26Jul 11, 2013Updated 12 years ago
- The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologi…☆3,146Updated this week
- Arm neon optimization practice☆393Dec 22, 2020Updated 5 years ago
- An open optimized software library project for the ARM® Architecture☆1,536Dec 9, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- The platform independent header allowing to compile any C/C++ code containing ARM NEON intrinsic functions for x86 target systems using S…☆494Apr 28, 2026Updated 3 weeks ago
- arm-neon☆93Aug 2, 2024Updated last year
- Handy tools & graphics API abstraction for blazing fast prototyping☆10Jan 17, 2024Updated 2 years ago
- row-major matmul optimization☆725Updated this week
- C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, NEON, SVE for ARM, HVX for Hex…☆2,250May 5, 2026Updated 2 weeks ago
- Low-precision matrix multiplication☆1,843Jan 29, 2024Updated 2 years ago
- ☆2,012Jul 29, 2023Updated 2 years ago
- Automatically exported from code.google.com/p/sse2neon☆289Jul 21, 2020Updated 5 years ago
- Heterogeneous Run Time version of Caffe. Added heterogeneous capabilities to the Caffe, uses heterogeneous computing infrastructure frame…☆269Oct 16, 2018Updated 7 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Implementation of various math, img processing, etc functions for ARMv7 and NEON☆18Dec 26, 2010Updated 15 years ago
- benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.☆202Feb 18, 2021Updated 5 years ago
- Just my local copy of math-neon with build script☆95Aug 10, 2018Updated 7 years ago
- mperf是一个面向移动/嵌入式平台的算子性能调优工具箱☆194Aug 17, 2023Updated 2 years ago
- C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE, WebAssembly, VSX, RISC-…☆2,692Updated this week
- High-efficiency floating-point neural network inference operators for mobile, server, and Web☆2,341Updated this week
- ☆21Apr 13, 2022Updated 4 years ago
- there are guide examples for mobile cv algorithms optimization.☆29Oct 24, 2022Updated 3 years ago
- Acceleration package for neural networks on multi-core CPUs☆1,706Jun 11, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- FeatherCNN is a high performance inference engine for convolutional neural networks.☆1,228Sep 24, 2019Updated 6 years ago
- demo code of my blog☆56Dec 20, 2023Updated 2 years ago
- A CPU tool for benchmarking the peak of floating points☆582May 4, 2026Updated 2 weeks ago
- Winograd minimal convolution algorithm generator for convolutional neural networks.☆628Feb 9, 2026Updated 3 months ago
- BLISlab: A Sandbox for Optimizing GEMM☆562Jun 17, 2021Updated 4 years ago
- Introduction about SIMD instructions. Mainly about SSE and AVX.☆13Mar 13, 2018Updated 8 years ago
- Arm NN ML Software.☆1,303Jan 23, 2026Updated 3 months ago
- Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators☆1,549Aug 28, 2019Updated 6 years ago
- Better CMake Experience☆36Dec 9, 2025Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Tengine is a lite, high performance, modular inference engine for embedded device☆4,523Mar 6, 2025Updated last year
- MTCNN Face Detection & Alignment☆203Sep 8, 2017Updated 8 years ago
- Lightweight Mat and imread()/imwrite()/imshow()☆83Jun 17, 2024Updated last year
- Heterogeneous Run Time version of MXNet. Added heterogeneous capabilities to the MXNet, uses heterogeneous computing infrastructure frame…☆72Feb 11, 2018Updated 8 years ago
- A translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation☆1,507Apr 12, 2026Updated last month
- ☆42Jun 25, 2020Updated 5 years ago
- Efficient implementation of maksed AES on ARM NEON☆23Jun 6, 2017Updated 8 years ago