shibatch / sleef
SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
☆635Updated this week
Related projects: ⓘ
- Agenium Scale vectorization library for CPUs and GPUs☆324Updated 2 years ago
- std::simd for GCC [ISO/IEC TS 19570:2018]☆571Updated last year
- Portable header-only C++ low level SIMD library☆1,221Updated 3 weeks ago
- SIMD Vector Classes for C++☆1,447Updated 3 months ago
- Vector class library, latest version☆1,281Updated 7 months ago
- C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))☆2,150Updated this week
- Official git repository for libdivide: optimized integer division☆1,083Updated last week
- oneAPI DPC++ Library (oneDPL) https://software.intel.com/content/www/us/en/develop/tools/oneapi/components/dpc-library.html☆720Updated this week
- Expressive Vector Engine - SIMD in C++ Goes Brrrr☆933Updated this week
- Library for specialized dense and sparse matrix operations, and deep learning primitives.☆839Updated this week
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆513Updated 3 months ago
- Open Source Parallel STL implementation☆514Updated 7 months ago
- Conversion to/from half-precision floating point formats☆324Updated last month
- C++ template library for high performance SIMD based sorting algorithms☆844Updated last week
- Implementations of SIMD instruction sets for systems which don't natively support them.☆2,334Updated this week
- A lightweight high performance tensor algebra framework for modern C++☆737Updated 5 months ago
- 🚀 Fast C/C++ bit population count library☆320Updated 2 months ago
- Enoki: structured vectorization and differentiation on modern processor architectures☆1,244Updated 5 months ago
- Thin, unified, C++-flavored wrappers for the CUDA APIs☆768Updated last week
- Reference implementation of Dragonbox in C++☆596Updated 2 weeks ago
- What features does your CPU and OS support?☆272Updated 3 weeks ago
- LLVM Optimization to extract a function, embedded in its intermediate representation in the binary, and execute it using the LLVM Just-In…☆510Updated 3 years ago
- VexCL is a C++ vector expression template library for OpenCL/CUDA/OpenMP☆701Updated 2 years ago
- A tool to graphically visualize SIMD code☆658Updated last year
- Fast and exact implementation of the C++ from_chars functions for number types: 4x to 10x faster than strtod, part of GCC 12, Chromium, R…☆1,413Updated this week
- Compressed numerical arrays that support high-speed random access☆765Updated 2 weeks ago
- Intel® Implicit SPMD Program Compiler☆2,473Updated this week
- An implementation of BLAS using the SYCL open standard.☆250Updated 2 weeks ago
- Repository for nvCOMP docs and examples. nvCOMP is a library for fast lossless compression/decompression on the GPU that can be downloade…☆555Updated last week
- pocl - Portable Computing Language☆911Updated this week