shibatch / sleef
SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
☆667Updated last week
Related projects ⓘ
Alternatives and complementary repositories for sleef
- Agenium Scale vectorization library for CPUs and GPUs☆328Updated 3 years ago
- Vector class library, latest version☆1,308Updated 9 months ago
- std::simd for GCC [ISO/IEC TS 19570:2018]☆579Updated last year
- C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))☆2,211Updated last week
- Portable header-only C++ low level SIMD library☆1,242Updated 2 months ago
- Library for specialized dense and sparse matrix operations, and deep learning primitives.☆850Updated this week
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆518Updated 5 months ago
- oneAPI DPC++ Library (oneDPL) https://software.intel.com/content/www/us/en/develop/tools/oneapi/components/dpc-library.html☆724Updated this week
- SIMD Vector Classes for C++☆1,458Updated 5 months ago
- Expressive Vector Engine - SIMD in C++ Goes Brrrr☆964Updated this week
- Open Source Parallel STL implementation☆517Updated 9 months ago
- Implementations of SIMD instruction sets for systems which don't natively support them.☆2,406Updated this week
- Conversion to/from half-precision floating point formats☆333Updated 3 months ago
- Official git repository for libdivide: optimized integer division☆1,106Updated 2 weeks ago
- C++ template library for high performance SIMD based sorting algorithms☆887Updated last week
- A tool to graphically visualize SIMD code☆664Updated last year
- A lightweight high performance tensor algebra framework for modern C++☆751Updated 7 months ago
- An implementation of BLAS using the SYCL open standard.☆259Updated 2 weeks ago
- Intel® Implicit SPMD Program Compiler☆2,520Updated this week
- The Hoard Memory Allocator: A Fast, Scalable, and Memory-efficient Malloc for Linux, Windows, and Mac.☆1,106Updated 3 months ago
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆439Updated 2 weeks ago
- LLVM Optimization to extract a function, embedded in its intermediate representation in the binary, and execute it using the LLVM Just-In…☆513Updated 3 years ago
- Repository for nvCOMP docs and examples. nvCOMP is a library for fast lossless compression/decompression on the GPU that can be downloade…☆560Updated 2 months ago
- Portable and vendor neutral framework for parallel programming on heterogeneous platforms.☆396Updated this week
- Compressed numerical arrays that support high-speed random access☆772Updated this week
- 🚀 Fast C/C++ bit population count library☆330Updated 4 months ago
- Omnitrace: Application Profiling, Tracing, and Analysis☆299Updated this week
- AVX-optimized sin(), cos(), exp() and log() functions☆113Updated 2 years ago
- Storage for my snippets, toy programs, etc.☆320Updated 6 months ago
- stdgpu: Efficient STL-like Data Structures on the GPU☆1,162Updated this week