DLTcollab / sse2neonLinks
A translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation
☆1,430Updated last month
Alternatives and similar repositories for sse2neon
Users that are interested in sse2neon are comparing it to the libraries listed below
Sorting:
- Implementations of SIMD instruction sets for systems which don't natively support them.☆2,806Updated last week
- The platform independent header allowing to compile any C/C++ code containing ARM NEON intrinsic functions for x86 target systems using S…☆477Updated 2 weeks ago
- Makes ARM NEON documentation accessible (with examples)☆404Updated last year
- C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))☆2,501Updated 2 weeks ago
- Optimized implementations of various library functions for ARM architecture processors☆661Updated 2 weeks ago
- C++ template library for high performance SIMD based sorting algorithms☆978Updated 3 weeks ago
- Vector class library, latest version☆1,396Updated last year
- C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, NEON for ARM.☆2,208Updated this week
- SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT☆764Updated 2 weeks ago
- Intel® Implicit SPMD Program Compiler☆2,754Updated last week
- Encapsulate the frequently used AVX instructions as independent modules to reduce repeated development workload.☆125Updated last year
- Portable header-only C++ low level SIMD library☆1,292Updated last year
- Contains the source code examples described in the "Intel® 64 and IA-32 Architectures Optimization Reference Manual"☆803Updated last year
- Automatically exported from code.google.com/p/sse2neon☆290Updated 5 years ago
- SIMD Vector Classes for C++☆1,510Updated last year
- Portable wrapper for SIMD and vector instructions written in C++11. Compatible with NEON, SSE, AVX, AVX-512 and SVE (length specific).☆510Updated 3 months ago
- Portable (POSIX/Windows/Emscripten) thread pool for C/C++☆382Updated last year
- pocl - Portable Computing Language☆1,025Updated this week
- Performance-portable, length-agnostic SIMD with runtime dispatch☆5,046Updated last week
- Agenium Scale vectorization library for CPUs and GPUs☆333Updated 3 years ago
- CPU INFOrmation library (x86/x86-64/ARM/ARM64, Linux/Windows/Android/macOS/iOS)☆1,125Updated last month
- Official git repository for libdivide: optimized integer division☆1,246Updated 3 months ago
- The Hoard Memory Allocator: A Fast, Scalable, and Memory-efficient Malloc for Linux, Windows, and Mac.☆1,171Updated 3 weeks ago
- A JIT assembler for x86/x64 architectures supporting FPU, MMX, SSE (1-4), AVX (1-2, 512), APX, and AVX10.2☆2,187Updated last month
- Apple AMX Instruction Set☆1,152Updated 9 months ago
- std::simd for GCC [ISO/IEC TS 19570:2018]☆629Updated 2 years ago
- Public domain cross platform lock free thread caching 16-byte aligned memory allocator implemented in C☆2,344Updated last month
- Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, …☆1,712Updated this week
- Khronos OpenCL-CLHPP☆405Updated last month
- Fast Base64 stream encoder/decoder in C99, with SIMD acceleration☆962Updated 5 months ago