oprecomp / FloatX
Header-only C++ library for low precision floating point type emulation.
☆171Updated 5 years ago
Alternatives and similar repositories for FloatX:
Users that are interested in FloatX are comparing it to the libraries listed below
- High-level C++ for Accelerator Clusters☆146Updated this week
- Execution primitives for C++☆153Updated 4 years ago
- Microbenchmarking for Modern C++☆219Updated 4 years ago
- A simple, extensible, portable, efficient and header-only SIMD library!☆229Updated 3 years ago
- Task graph-based asynchronous programming system using C++ coroutine☆89Updated last year
- C++ zero-cost abstraction for SoA/AoS memory layouts☆183Updated 2 years ago
- UME::SIMD A library for explicit simd vectorization.☆90Updated 7 years ago
- Agenium Scale vectorization library for CPUs and GPUs☆332Updated 3 years ago
- A C++14-and-later expression template library☆109Updated last week
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆533Updated last month
- Modular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template…☆358Updated 8 months ago
- Clang with JIT extensions☆229Updated 2 years ago
- A micro microbenchmarking library for C++11 in a single header file☆217Updated 2 weeks ago
- The Berkeley Container Library☆124Updated last year
- Programming Accelerators with C++ (PACXX)☆58Updated 7 years ago
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆261Updated 3 months ago
- CUDA kernel author's tools☆111Updated 2 years ago
- Eliminate all the tedious hassle when making state-of-the-art C++ 14 - 23 libraries!☆167Updated 2 weeks ago
- SYCL-ML is a C++ library, implementing classical machine learning algorithms using SYCL.☆66Updated 5 years ago
- Add-on packages for Vector class library☆73Updated last year
- ☆69Updated 4 years ago
- Multiprecision for modern C++☆312Updated 4 months ago
- std::simd for GCC [ISO/IEC TS 19570:2018]☆608Updated 2 years ago
- A compile-time linear algebra system for C++☆120Updated 3 years ago
- Reference Implementation for stdBLAS☆137Updated this week
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆441Updated 5 months ago
- Blazing-fast Expression Templates Library (ETL) with GPU support, in C++☆222Updated last year
- Reference implementation of mdspan targeting C++23☆451Updated this week
- Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal - all it takes to sum a lot of numbers fast!☆96Updated last month
- Open Source Parallel STL implementation☆525Updated last year