jhjourdan / SIMD-math-prims
Vectorizable implementations of some mathematical functions
☆103Updated 4 years ago
Alternatives and similar repositories for SIMD-math-prims:
Users that are interested in SIMD-math-prims are comparing it to the libraries listed below
- fast log and exp functions for AVX2/AVX-512☆227Updated last week
- Portable C++ SIMD library☆174Updated 5 years ago
- Flexible Library for Efficient Numerical Solutions☆127Updated 3 years ago
- UME::SIMD A library for explicit simd vectorization.☆91Updated 7 years ago
- Vectorization EDSL library☆15Updated 5 years ago
- Generic SIMD intrinsic to allow for portable SIMD intrinsic programming☆42Updated 11 years ago
- AVX-optimized sin(), cos(), exp() and log() functions☆121Updated 3 years ago
- Multi-dimensional C++ arrays which store objects in a Struct-of-Arrays (SoA) memory layout for efficient vectorization and zero address g…☆74Updated 4 years ago
- Approximate and vectorized versions of common mathematical functions☆201Updated 5 years ago
- SPMD in C++☆68Updated 4 years ago
- Fast random number generators: Vectorized (SIMD) version of xorshift128+☆113Updated 4 years ago
- Full-speed Array of Structures access☆164Updated last year
- SSE2 Optimized GLSL-like math library☆116Updated 10 years ago
- Optimized CppSPMD test project: macro control flow, SSE4.1/AVX1/AVX2/AVX2 FMA support☆117Updated 4 years ago
- C++ micro benchmark library☆92Updated 8 years ago
- Execution primitives for C++☆154Updated 4 years ago
- C++ multidimensional arrays in the spirit of the STL☆200Updated 2 months ago
- Fast integer division with divisor not known at compile time. To be used primarily in CUDA kernels.☆70Updated 9 years ago
- Mirror of the Cephes C source for reference☆91Updated last year
- Pruning elements in SIMD vectors (i.e., packing left elements)☆65Updated last year
- Fixed Point Arithmetic C++14 Library☆110Updated 6 years ago
- Add-on packages for Vector class library☆73Updated last year
- Programming Accelerators with C++ (PACXX)☆58Updated 7 years ago
- stl compatible hashtable☆29Updated 7 years ago
- Fastest Histogram Construction☆70Updated 3 years ago
- Vectorized version of the PCG random number generator☆76Updated 2 weeks ago
- Blazing-fast Expression Templates Library (ETL) with GPU support, in C++☆222Updated last year
- Lock-free parallel disjoint set data structure (aka UNION-FIND) with path compression and union by rank☆64Updated 9 years ago
- Boost SIMD☆232Updated 5 years ago
- A fast and highly scalable GPU dynamic memory allocator☆104Updated 10 years ago