DLTcollab / sse2neonLinks
A translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation
☆1,396Updated 2 weeks ago
Alternatives and similar repositories for sse2neon
Users that are interested in sse2neon are comparing it to the libraries listed below
Sorting:
- Implementations of SIMD instruction sets for systems which don't natively support them.☆2,715Updated last month
- C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))☆2,413Updated this week
- The platform independent header allowing to compile any C/C++ code containing ARM NEON intrinsic functions for x86 target systems using S…☆464Updated last month
- Portable header-only C++ low level SIMD library☆1,277Updated 10 months ago
- C++ template library for high performance SIMD based sorting algorithms☆949Updated last week
- SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT☆732Updated 2 months ago
- Optimized implementations of various library functions for ARM architecture processors☆629Updated this week
- C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, NEON for ARM.☆2,164Updated last week
- Makes ARM NEON documentation accessible (with examples)☆398Updated last year
- Intel® Implicit SPMD Program Compiler☆2,689Updated this week
- Vector class library, latest version☆1,375Updated last year
- SIMD Vector Classes for C++☆1,495Updated last year
- Fast and exact implementation of the C++ from_chars functions for number types: 4x to 10x faster than strtod, part of GCC 12, Chromium, R…☆1,800Updated 3 weeks ago
- std::simd for GCC [ISO/IEC TS 19570:2018]☆620Updated 2 years ago
- Public domain cross platform lock free thread caching 16-byte aligned memory allocator implemented in C☆2,307Updated last month
- C++14 lock-free queue.☆1,671Updated last month
- Message passing based allocator☆1,694Updated last month
- Agenium Scale vectorization library for CPUs and GPUs☆333Updated 3 years ago
- oneAPI Threading Building Blocks (oneTBB)☆6,190Updated last week
- Performance-portable, length-agnostic SIMD with runtime dispatch☆4,706Updated last week
- A C library that may be linked into a C/C++ program to produce symbolic backtraces☆1,076Updated 2 months ago
- Encapsulate the frequently used AVX instructions as independent modules to reduce repeated development workload.☆122Updated last year
- Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, …☆1,655Updated last week
- Unicode routines (UTF8, UTF16, UTF32) and Base64: billions of characters per second using SSE2, AVX2, NEON, AVX-512, RISC-V Vector Extens…☆1,407Updated this week
- A family of header-only, very fast and memory-friendly hashmap and btree containers.☆2,988Updated 3 weeks ago
- Contains the source code examples described in the "Intel® 64 and IA-32 Architectures Optimization Reference Manual"☆797Updated last year
- Portable wrapper for SIMD and vector instructions written in C++11. Compatible with NEON, SSE, AVX, AVX-512 and SVE (length specific).☆505Updated this week
- Official git repository for libdivide: optimized integer division☆1,215Updated last week
- The Hoard Memory Allocator: A Fast, Scalable, and Memory-efficient Malloc for Linux, Windows, and Mac.☆1,170Updated 2 months ago
- Fast & memory efficient hashtable based on robin hood hashing for C++11/14/17/20☆1,571Updated 2 years ago