DLTcollab / sse2neonLinks
A translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation
☆1,418Updated 2 weeks ago
Alternatives and similar repositories for sse2neon
Users that are interested in sse2neon are comparing it to the libraries listed below
Sorting:
- Implementations of SIMD instruction sets for systems which don't natively support them.☆2,764Updated last week
- The platform independent header allowing to compile any C/C++ code containing ARM NEON intrinsic functions for x86 target systems using S…☆470Updated 3 months ago
- Makes ARM NEON documentation accessible (with examples)☆403Updated last year
- SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT☆750Updated last month
- C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))☆2,465Updated last week
- Vector class library, latest version☆1,387Updated last year
- C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, NEON for ARM.☆2,189Updated last week
- C++ template library for high performance SIMD based sorting algorithms☆958Updated 2 months ago
- Optimized implementations of various library functions for ARM architecture processors☆648Updated 2 weeks ago
- A cross platform C99 library to get cpu features at runtime.☆2,544Updated 2 weeks ago
- Intel® Implicit SPMD Program Compiler☆2,725Updated last week
- CPU INFOrmation library (x86/x86-64/ARM/ARM64, Linux/Windows/Android/macOS/iOS)☆1,112Updated 2 weeks ago
- Performance-portable, length-agnostic SIMD with runtime dispatch☆4,855Updated this week
- Portable header-only C++ low level SIMD library☆1,285Updated 11 months ago
- Encapsulate the frequently used AVX instructions as independent modules to reduce repeated development workload.☆124Updated last year
- Public domain cross platform lock free thread caching 16-byte aligned memory allocator implemented in C☆2,333Updated 2 weeks ago
- Fast and exact implementation of the C++ from_chars functions for number types: 4x to 10x faster than strtod, part of GCC 12, Chromium, R…☆1,851Updated last week
- Contains the source code examples described in the "Intel® 64 and IA-32 Architectures Optimization Reference Manual"☆799Updated last year
- Agenium Scale vectorization library for CPUs and GPUs☆333Updated 3 years ago
- Portable wrapper for SIMD and vector instructions written in C++11. Compatible with NEON, SSE, AVX, AVX-512 and SVE (length specific).☆507Updated last month
- pocl - Portable Computing Language☆1,014Updated last week
- Repository for nvCOMP docs and examples. nvCOMP is a library for fast lossless compression/decompression on the GPU that can be downloade…☆595Updated 11 months ago
- std::simd for GCC [ISO/IEC TS 19570:2018]☆626Updated 2 years ago
- Heavily optimized library for DEFLATE/zlib/gzip compression and decompression☆1,141Updated last month
- A C library that may be linked into a C/C++ program to produce symbolic backtraces☆1,098Updated 4 months ago
- Apple AMX Instruction Set☆1,128Updated 7 months ago
- SIMD Vector Classes for C++☆1,507Updated last year
- Official git repository for libdivide: optimized integer division☆1,232Updated 2 months ago
- Khronos OpenCL-Headers☆729Updated last month
- A JIT assembler for x86/x64 architectures supporting MMX, SSE (1-4), AVX (1-2, 512), FPU, APX, and AVX10.2☆2,159Updated this week