JishinMaster / simd_utilsLinks
A header only library implementing common mathematical functions using SIMD intrinsics
☆109Updated 3 weeks ago
Alternatives and similar repositories for simd_utils
Users that are interested in simd_utils are comparing it to the libraries listed below
Sorting:
- AVX-optimized sin(), cos(), exp() and log() functions☆125Updated 3 years ago
- A fast implementation of log() and exp()☆53Updated 2 years ago
- ☆31Updated 3 years ago
- Agenium Scale vectorization library for CPUs and GPUs☆333Updated 3 years ago
- Add-on packages for Vector class library☆75Updated last year
- Cross platform portable accelerate math library using universal intrinsics.☆81Updated 4 years ago
- Header-only C++ library for low precision floating point type emulation.☆175Updated 5 years ago
- SIMD optimised library for matrix inversion of 2x2, 3x3, and 4x4 matrices.☆93Updated 9 years ago
- CPP20 implementation of a 16-bit floating-point type mimicking most of the IEEE 754 behavior. Single file and header-only.☆41Updated last year
- ☆148Updated last year
- A simple and fast minimalistic header-only library allowing to run async tasks and execute task graphs.☆53Updated 8 months ago
- llvm::SmallVector stripped from dependencies.☆34Updated 5 years ago
- Struct-of-Arrays generator for C++ projects.☆59Updated last year
- Fast random number generators: Vectorized (SIMD) version of xorshift128+☆117Updated 5 years ago
- Task graph-based asynchronous programming system using C++ coroutine☆92Updated last year
- A High-Throughput Parallel Lossless Compressor for Scientific Data☆70Updated 2 years ago
- UME::SIMD A library for explicit simd vectorization.☆91Updated 7 years ago
- C++ template library for floating point operations☆30Updated 3 weeks ago
- Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal, and Rust - all it takes to sum a lot of numbers fast!☆103Updated last week
- A curated list of awesome SIMD frameworks, libraries and software☆199Updated 10 months ago
- A High-Performance JIT-Based C++ Expression/Script Execution Engine with SIMD Vectorization Support☆91Updated last month
- Conversion to/from half-precision floating point formats☆362Updated last year
- performance experiments for C++ exception handling☆30Updated 3 years ago
- Extremely fast x86 / AVX2 assembly implementations of common operations for linear arrays: checking whether array contains element, findi…☆38Updated 3 years ago
- C++20 Tensor library☆27Updated last week
- C++ implementation of a 16 bit floating-point type mimicking most of the IEEE 754 behaviour. Compatible with the half data type used as t…☆148Updated 13 years ago
- Bistra is a domain-specific language designed to generate high-performance kernels (such as GEMMs, convolutions, etc). The program is des…☆9Updated last year
- Teaching Vectorization and SIMD using Intel Intrinsics in a Computer Organization and Architecture class☆15Updated 5 months ago
- Looking into the performance of heaps, starting with the Min-Max Heap☆68Updated 4 years ago
- Mirror of the Cephes C source for reference☆94Updated last year