genericsimd / generic_simd
Generic SIMD intrinsic to allow for portable SIMD intrinsic programming
☆42Updated 11 years ago
Alternatives and similar repositories for generic_simd:
Users that are interested in generic_simd are comparing it to the libraries listed below
- SSE2 Optimized GLSL-like math library☆116Updated 10 years ago
- Allocation benchmarks☆30Updated 8 years ago
- ☆13Updated 8 years ago
- Fast C header-only library for popcnt, pospopcnt, and set algebraic operations☆45Updated 5 years ago
- portability macros for compiler and hardware micro operations☆35Updated 7 months ago
- A hash table implementation using Robin Hood Linear Probing☆50Updated 10 years ago
- ☆31Updated 9 years ago
- Looking into the performance of heaps, starting with the Min-Max Heap☆65Updated 4 years ago
- Fastest Histogram Construction☆69Updated 3 years ago
- SPMD in C++☆68Updated 4 years ago
- Half precision floating point C++ library (imported from sourceforge upstream).☆34Updated 7 years ago
- Fastest CPU SIMD (SSE4) sorting networks for small integer arrays (2-6 elements), also optimal amd64 assembly and notes on getting compil…☆45Updated 3 years ago
- Fast open addressing hash table☆41Updated last year
- C++ library to pack and unpack vectors of integers having a small range of values using a technique called Frame of Reference☆51Updated last year
- Software implementation of any size ieee754 floating points☆53Updated 4 years ago
- benchmarking positional population count☆14Updated 11 months ago
- Polyfill some holes in the SSE intrinsics set☆50Updated 2 years ago
- SIMD optimizations related to 2D computer graphics☆34Updated 7 years ago
- LZSSE compression codec ported to SIMDe☆19Updated 4 years ago
- Pruning elements in SIMD vectors (i.e., packing left elements)☆64Updated last year
- Tiny portable C++ library for atomic operations☆54Updated last year
- High performance multithreading toolkit for C++17☆46Updated 3 months ago
- Fast, shared, upgradeable, non-recursive and non-fair mutex☆30Updated 6 years ago
- For details, see the blog post:☆32Updated last year
- fast prime sieve and hash algorithm☆38Updated last week
- Coroutines/Fibers implementation for x86☆65Updated 8 years ago
- String to Float Benchmark☆19Updated 6 years ago
- Experimental ranges for CUDA☆25Updated 6 years ago
- A scoped stack allocator☆36Updated 5 years ago
- C++ reflection framework (for fun)☆14Updated 10 years ago