0xBYTESHIFT / fp16
class that represents 16-bit floating point (half)
☆11Updated last year
Alternatives and similar repositories for fp16:
Users that are interested in fp16 are comparing it to the libraries listed below
- A simple and fast minimalistic header-only library allowing to run async tasks and execute task graphs.☆53Updated 4 months ago
- C99/C++ header-only library for division via fixed-point multiplication by inverse☆50Updated last year
- Parallel Tasking Library (PTL) - Lightweight C++11 mutilthreading tasking system featuring thread-pool, task-groups, and lock-free task q…☆45Updated 5 months ago
- A header only library implementing common mathematical functions using SIMD intrinsics☆103Updated 2 months ago
- C++ Lightweight Utility Extensions☆75Updated 3 years ago
- Runs a single CUDA/OpenCL kernel, taking its source from a file and arguments from the command-line☆23Updated this week
- A C++ neural network library for machine learning☆14Updated 11 months ago
- An Open Convolutional Neural Network Framework in C++ From Scratch☆61Updated 4 years ago
- A single file C++17 header-only Minimal Acyclic Subsequential Transducers, or Finite State Transducers☆55Updated 2 years ago
- Header-only safetensors loader and saver in C++☆56Updated last week
- Portable 128-bit SIMD intrinsics☆58Updated last year
- Comparison of C++ Serialization Libraries for Graph Data☆34Updated 3 years ago
- CPP20 implementation of a 16-bit floating-point type mimicking most of the IEEE 754 behavior. Single file and header-only.☆41Updated last year
- C++20 Tensor library☆26Updated 3 months ago
- a single-header math library☆16Updated 6 months ago
- how to design cpu gemm on x86 with avx256, that can beat openblas.☆70Updated 6 years ago
- Fast integer division with divisor not known at compile time. To be used primarily in CUDA kernels.☆70Updated 9 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- Common libraries for PPL projects☆29Updated last month
- Task System presented in "Better Code: Concurrency - Sean Parent"☆42Updated 4 years ago
- The Farm-SVE package provides a header that implements the ARM C language extensions (ACLE) for the ARM Scalable Vector Extension (SVE) i…☆14Updated last year
- Automatically exported from code.google.com/p/math-neon☆40Updated 10 years ago
- AVX-optimized sin(), cos(), exp() and log() functions☆123Updated 3 years ago
- Fast C header-only library for popcnt, pospopcnt, and set algebraic operations☆45Updated 5 years ago
- Simple and efficient memory pool is implemented with C++11.☆8Updated 2 years ago
- A modern, C++20-native, single-file header-only dense 2D matrix library.☆87Updated last year
- Conversion to/from half-precision floating point formats☆347Updated 8 months ago
- An easy way to run, test, benchmark and tune OpenCL kernel files☆23Updated last year
- Simple example of using Vulkan for GPGPU computing☆53Updated 6 years ago
- SIMD optimised library for matrix inversion of 2x2, 3x3, and 4x4 matrices.☆93Updated 9 years ago