thenifty / neon-guide
Makes ARM NEON documentation accessible (with examples)
☆382Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for neon-guide
- The platform independent header allowing to compile any C/C++ code containing ARM NEON intrinsic functions for x86 target systems using S…☆430Updated 2 months ago
- Conversion to/from half-precision floating point formats☆333Updated 3 months ago
- Portable (POSIX/Windows/Emscripten) thread pool for C/C++☆353Updated 5 months ago
- Automatically exported from code.google.com/p/sse2neon☆285Updated 4 years ago
- Just my local copy of math-neon with build script☆91Updated 6 years ago
- ☆303Updated last week
- A stub opecl library that dynamically dlopen/dlsyms opencl implementations at runtime based on environment variables. Will be useful when…☆67Updated 8 months ago
- Automatically exported from code.google.com/p/opencl-book-samples☆162Updated 5 years ago
- A tool which profiles OpenCL devices to find their peak capacities☆411Updated 2 weeks ago
- An open optimized software library project for the ARM® Architecture☆1,462Updated last year
- Intercept Layer for Debugging and Analyzing OpenCL Applications☆314Updated 2 weeks ago
- Khronos OpenCL-CLHPP☆379Updated 3 weeks ago
- Optimized implementations of various library functions for ARM architecture processors☆601Updated this week
- Automatically exported from code.google.com/p/math-neon☆38Updated 9 years ago
- arm neon 相关文档和指令意义☆238Updated 5 years ago
- An OpenCL device simulator and debugger☆346Updated 2 months ago
- A profiler to disclose and quantify hardware features on GPUs.☆162Updated 2 years ago
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆124Updated last year
- Agenium Scale vectorization library for CPUs and GPUs☆328Updated 3 years ago
- Arm neon optimization practice☆388Updated 3 years ago
- The OpenCL Conformance Tests☆184Updated this week
- SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT☆667Updated last week
- A CPU tool for benchmarking the peak of floating points☆503Updated last month
- ☆118Updated 11 years ago
- Fork of https://source.codeaurora.org/quic/hexagon_nn/nnlib☆54Updated last year
- VeriSilicon Tensor Interface Module☆224Updated 3 months ago
- Portable wrapper for SIMD and vector instructions written in C++11. Compatible with NEON, SSE, AVX, AVX-512 and SVE (length specific).☆479Updated last week
- mperf是一个面向移动/嵌入式平台的算子性能调优工具箱☆171Updated last year
- Portable header-only C++ low level SIMD library☆1,242Updated 2 months ago
- Demonstration of various hardware effects on CUDA GPUs.☆358Updated 11 months ago