oprecomp / FloatX
Header-only C++ library for low precision floating point type emulation.
☆163Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for FloatX
- High-level C++ for Accelerator Clusters☆142Updated this week
- A micro microbenchmarking library for C++11 in a single header file☆210Updated 8 months ago
- A simple, extensible, portable, efficient and header-only SIMD library!☆229Updated 3 years ago
- The Berkeley Container Library☆120Updated last year
- Modular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template…☆353Updated 3 months ago
- Microbenchmarking for Modern C++☆211Updated 4 years ago
- C++ zero-cost abstraction for SoA/AoS memory layouts☆184Updated 2 years ago
- Mirror of the Cephes C source for reference☆86Updated 10 months ago
- Agenium Scale vectorization library for CPUs and GPUs☆326Updated 3 years ago
- Concurrent CPU-GPU Programming using Task Models☆100Updated 4 years ago
- CUDA kernel author's tools☆107Updated 2 years ago
- Programming Accelerators with C++ (PACXX)☆58Updated 6 years ago
- C++11 metaprogramming library☆243Updated this week
- Caliper is an instrumentation and performance profiling library☆350Updated last week
- SYCL-ML is a C++ library, implementing classical machine learning algorithms using SYCL.☆64Updated 4 years ago
- std::simd for GCC [ISO/IEC TS 19570:2018]☆578Updated last year
- Blazing-fast Expression Templates Library (ETL) with GPU support, in C++☆218Updated 10 months ago
- ☆68Updated 4 years ago
- UME::SIMD A library for explicit simd vectorization.☆90Updated 6 years ago
- Profiling Taskflow Programs through Visualization☆47Updated last year
- Reference implementation of mdspan targeting C++23☆406Updated last month
- Reference Implementation for stdBLAS☆128Updated last week
- The x template library☆207Updated 5 months ago
- std::bitset with constexpr implementations plus additional features.☆117Updated last year
- Abstraction Library for Parallel Kernel Acceleration☆356Updated this week
- A fast work-stealing queue template in C++☆293Updated 9 months ago
- A simple framework for compile-time benchmarks☆181Updated 3 years ago
- AVX-optimized sin(), cos(), exp() and log() functions☆113Updated 2 years ago
- An implementation of BLAS using the SYCL open standard.☆259Updated last week
- low-level library for minimizing the size of your types☆110Updated 5 years ago