shibatch / sleef
SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
☆697Updated this week
Alternatives and similar repositories for sleef:
Users that are interested in sleef are comparing it to the libraries listed below
- Agenium Scale vectorization library for CPUs and GPUs☆330Updated 3 years ago
- Vector class library, latest version☆1,337Updated last year
- std::simd for GCC [ISO/IEC TS 19570:2018]☆602Updated last year
- Portable header-only C++ low level SIMD library☆1,260Updated 5 months ago
- Library for specialized dense and sparse matrix operations, and deep learning primitives.☆862Updated this week
- Expressive Vector Engine - SIMD in C++ Goes Brrrr☆1,142Updated this week
- C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))☆2,308Updated this week
- SIMD Vector Classes for C++☆1,474Updated 8 months ago
- C++ template library for high performance SIMD based sorting algorithms☆914Updated this week
- A lightweight high performance tensor algebra framework for modern C++☆771Updated 10 months ago
- Reference implementation of Dragonbox in C++☆630Updated 3 months ago
- Implementations of SIMD instruction sets for systems which don't natively support them.☆2,531Updated last week
- Conversion to/from half-precision floating point formats☆341Updated 6 months ago
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆521Updated this week
- Official git repository for libdivide: optimized integer division☆1,148Updated 3 weeks ago
- LLVM Optimization to extract a function, embedded in its intermediate representation in the binary, and execute it using the LLVM Just-In…☆518Updated 3 years ago
- oneAPI DPC++ Library (oneDPL) https://software.intel.com/content/www/us/en/develop/tools/oneapi/components/dpc-library.html☆735Updated this week
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆263Updated last month
- fast log and exp functions for AVX2/AVX-512☆225Updated last month
- Demonstration of various hardware effects on CUDA GPUs.☆364Updated last year
- Intel® Implicit SPMD Program Compiler☆2,590Updated this week
- 🚀 Fast C/C++ bit population count library☆337Updated 7 months ago
- Storage for my snippets, toy programs, etc.☆349Updated 2 weeks ago
- Reference implementation of mdspan targeting C++23☆435Updated this week
- Implementation of SYCL and C++ standard parallelism for CPUs and GPUs from all vendors: The independent, community-driven compiler for C+…☆1,510Updated this week
- SIMD (SSE) population count --- http://0x80.pl/articles/sse-popcount.html☆335Updated 10 months ago
- Enoki: structured vectorization and differentiation on modern processor architectures☆1,273Updated 2 weeks ago
- A simple, extensible, portable, efficient and header-only SIMD library!☆230Updated 3 years ago
- Portable (POSIX/Windows/Emscripten) thread pool for C/C++☆365Updated 8 months ago
- Open Source Parallel STL implementation☆522Updated last year