DLTcollab / sse2neonLinks
A translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation
☆1,470Updated last week
Alternatives and similar repositories for sse2neon
Users that are interested in sse2neon are comparing it to the libraries listed below
Sorting:
- Implementations of SIMD instruction sets for systems which don't natively support them.☆2,919Updated this week
- The platform independent header allowing to compile any C/C++ code containing ARM NEON intrinsic functions for x86 target systems using S…☆481Updated 2 months ago
- Makes ARM NEON documentation accessible (with examples)☆406Updated last year
- C++ template library for high performance SIMD based sorting algorithms☆992Updated 3 months ago
- Optimized implementations of various library functions for ARM architecture processors☆682Updated this week
- C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE, WebAssembly, VSX, RISC-…☆2,581Updated last week
- Encapsulate the frequently used AVX instructions as independent modules to reduce repeated development workload.☆130Updated 2 years ago
- C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, NEON for ARM.☆2,228Updated this week
- Vector class library, latest version☆1,424Updated last year
- SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT☆797Updated 2 weeks ago
- Portable header-only C++ low level SIMD library☆1,297Updated last year
- CPU INFOrmation library (x86/x86-64/ARM/ARM64, Linux/Windows/Android/macOS/iOS)☆1,143Updated last week
- Intel® Implicit SPMD Program Compiler☆2,820Updated this week
- Agenium Scale vectorization library for CPUs and GPUs☆337Updated 4 years ago
- Portable (POSIX/Windows/Emscripten) thread pool for C/C++☆388Updated last year
- Repository for nvCOMP docs and examples. nvCOMP is a library for fast lossless compression/decompression on the GPU that can be downloade…☆607Updated last year
- Public domain cross platform lock free thread caching 16-byte aligned memory allocator implemented in C☆2,394Updated 2 months ago
- SIMD Vector Classes for C++☆1,514Updated last year
- Contains the source code examples described in the "Intel® 64 and IA-32 Architectures Optimization Reference Manual"☆807Updated last year
- Official git repository for libdivide: optimized integer division☆1,277Updated 3 weeks ago
- C++14 lock-free queue.☆1,800Updated 3 weeks ago
- Hardware locality (hwloc)☆671Updated this week
- Conversion to/from half-precision floating point formats☆379Updated 4 months ago
- A C library that may be linked into a C/C++ program to produce symbolic backtraces☆1,149Updated 2 months ago
- Portable wrapper for SIMD and vector instructions written in C++11. Compatible with NEON, SSE, AVX, AVX-512 and SVE (length specific).☆517Updated last month
- A cross platform C99 library to get cpu features at runtime.☆2,569Updated last month
- Khronos OpenCL-Headers☆745Updated last week
- pocl - Portable Computing Language☆1,046Updated this week
- [ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl☆2,309Updated last year
- zlib replacement with optimizations for "next generation" systems.☆1,918Updated this week