DLTcollab / sse2neon
A translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation
☆1,286Updated last month
Related projects: ⓘ
- Implementations of SIMD instruction sets for systems which don't natively support them.☆2,338Updated this week
- The platform independent header allowing to compile any C/C++ code containing ARM NEON intrinsic functions for x86 target systems using S…☆430Updated last week
- C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))☆2,150Updated this week
- C++ template library for high performance SIMD based sorting algorithms☆844Updated 2 weeks ago
- Makes ARM NEON documentation accessible (with examples)☆380Updated 5 months ago
- Vector class library, latest version☆1,282Updated 7 months ago
- Optimized implementations of various library functions for ARM architecture processors☆586Updated this week
- C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, VMX(Altivec) and VSX(Power7) f…☆2,036Updated last week
- Intel® Implicit SPMD Program Compiler☆2,473Updated last week
- Portable header-only C++ low level SIMD library☆1,221Updated 3 weeks ago
- Official git repository for libdivide: optimized integer division☆1,083Updated last week
- SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT☆638Updated this week
- Contains the source code examples described in the "Intel® 64 and IA-32 Architectures Optimization Reference Manual"☆775Updated 4 months ago
- Fast and exact implementation of the C++ from_chars functions for number types: 4x to 10x faster than strtod, part of GCC 12, Chromium, R…☆1,413Updated this week
- A cross platform C99 library to get cpu features at runtime.☆2,437Updated last week
- Automatically exported from code.google.com/p/sse2neon☆285Updated 4 years ago
- C/C++ Performance Profiler☆4,166Updated last week
- Speed-up over 50% in average vs traditional memcpy in gcc 4.9 or vc2012☆583Updated 5 months ago
- SIMD Vector Classes for C++☆1,447Updated 3 months ago
- C++ lockless queue.☆1,478Updated last month
- Performance-portable, length-agnostic SIMD with runtime dispatch☆4,099Updated this week
- An open optimized software library project for the ARM® Architecture☆1,459Updated last year
- [ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl☆2,293Updated 7 months ago
- Public domain cross platform lock free thread caching 16-byte aligned memory allocator implemented in C☆2,109Updated 2 months ago
- std::simd for GCC [ISO/IEC TS 19570:2018]☆571Updated last year
- Fast & memory efficient hashtable based on robin hood hashing for C++11/14/17/20☆1,511Updated last year
- a JIT assembler for x86(IA-32)/x64(AMD64, x86-64) MMX/SSE/SSE2/SSE3/SSSE3/SSE4/FPU/AVX/AVX2/AVX-512 by C++ header☆2,025Updated 3 weeks ago
- Apple AMX Instruction Set☆976Updated 3 months ago
- A family of header-only, very fast and memory-friendly hashmap and btree containers.☆2,471Updated 3 weeks ago
- A beautiful stack trace pretty printer for C++☆3,735Updated 2 months ago