genericsimd / generic_simdLinks
Generic SIMD intrinsic to allow for portable SIMD intrinsic programming
☆41Updated 11 years ago
Alternatives and similar repositories for generic_simd
Users that are interested in generic_simd are comparing it to the libraries listed below
Sorting:
- A hash table implementation using Robin Hood Linear Probing☆51Updated 11 years ago
- LZ77/LZSS designed for SSE based decompression☆142Updated 6 years ago
- Shader-Like Mathematical Expression JIT Engine for C++ Language☆59Updated 6 years ago
- Fastest Histogram Construction☆71Updated 3 years ago
- SSE2 Optimized GLSL-like math library☆117Updated 11 years ago
- Intel(R) Concurrent Collections for C++☆116Updated 2 years ago
- Polyfill some holes in the SSE intrinsics set☆50Updated 3 years ago
- portability macros for compiler and hardware micro operations☆36Updated last year
- Vectorized version of the PCG random number generator☆83Updated 7 months ago
- TLSF: two-level segregated fit O(1) allocator☆80Updated 3 years ago
- Generates benchmark data for two different data structures, then renders some graphs.☆112Updated 6 years ago
- A scoped stack allocator☆37Updated 6 years ago
- LZSSE compression codec ported to SIMDe☆19Updated 5 years ago
- High performance multithreading toolkit for C++17☆47Updated 2 months ago
- Fastest CPU SIMD (SSE4) sorting networks for small integer arrays (2-6 elements), also optimal amd64 assembly and notes on getting compil…☆50Updated 4 years ago
- Fast C header-only library for popcnt, pospopcnt, and set algebraic operations☆46Updated 5 years ago
- UME::SIMD A library for explicit simd vectorization.☆91Updated 7 years ago
- Allocation benchmarks☆30Updated 9 years ago
- Portable C++ SIMD library☆171Updated 6 years ago
- ☆13Updated 9 years ago
- Looking into the performance of heaps, starting with the Min-Max Heap☆69Updated 4 years ago
- Base64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)☆169Updated 7 months ago
- Pruning elements in SIMD vectors (i.e., packing left elements)☆65Updated last year
- ☆57Updated 6 years ago
- Coroutines/Fibers implementation for x86☆65Updated 9 years ago
- Fast C/C++ memory allocator with free-all operation☆55Updated 10 years ago
- Fast multi-threaded memory allocator☆79Updated 5 years ago
- SIMD macro assembler unified for ARM, MIPS, PPC and x86☆90Updated 10 months ago
- Experimental ranges for CUDA☆25Updated 6 years ago
- For details, see the blog post:☆33Updated last year