0xBYTESHIFT / fp16
class that represents 16-bit floating point (half)
☆11Updated last year
Alternatives and similar repositories for fp16:
Users that are interested in fp16 are comparing it to the libraries listed below
- A simple and fast minimalistic header-only library allowing to run async tasks and execute task graphs.☆51Updated 2 months ago
- A C++ neural network library for machine learning☆14Updated 9 months ago
- An Open Convolutional Neural Network Framework in C++ From Scratch☆60Updated 3 years ago
- Common libraries for PPL projects☆29Updated 4 months ago
- how to design cpu gemm on x86 with avx256, that can beat openblas.☆67Updated 5 years ago
- Parallel Tasking Library (PTL) - Lightweight C++11 mutilthreading tasking system featuring thread-pool, task-groups, and lock-free task q…☆43Updated 3 months ago
- An easy way to run, test, benchmark and tune OpenCL kernel files☆23Updated last year
- ☆19Updated 3 years ago
- PyTorch -> ONNX -> TVM for autotuning☆23Updated 4 years ago
- TSDG: An efficient index graph for graph-based nearest neighbor search☆9Updated 2 years ago
- Automatically exported from code.google.com/p/math-neon☆39Updated 9 years ago
- C++ fast hierarchical clustering algorithms☆85Updated last year
- Header-only safetensors loader and saver in C++☆53Updated 2 months ago
- C++ lock-free queue.☆12Updated 4 years ago
- transformer tokenizers (e.g. BERT tokenizer) in C++ (WIP)☆17Updated 2 years ago
- High-Performance Computing: CPU Instructions, GPU OpenCL & CUDA, etc.☆14Updated 9 months ago
- A pure C++ implementation of the lowess algorithm using templates☆21Updated 9 years ago
- Runs a single CUDA/OpenCL kernel, taking its source from a file and arguments from the command-line☆21Updated 2 weeks ago
- symmetric int8 gemm☆66Updated 4 years ago
- A header only library implementing common mathematical functions using SIMD intrinsics☆97Updated this week
- A C++ header-only for data transfer between linear algebra libraries (Eigen, Armadillo, OpenCV, ArrayFire, LibTorch).☆81Updated 9 months ago
- Convert ONNX models to plain C++ code (without dependencies)☆19Updated last year
- A demonstration of speeding up a 1D convolution using SSE☆50Updated 8 years ago
- ☆32Updated 6 months ago
- C++ header-only lib for extracting local patches☆15Updated 4 years ago
- 📦 TCP based publish-subscribe library for C++ 🌐☆41Updated last week
- C99/C++ header-only library for division via fixed-point multiplication by inverse☆49Updated 10 months ago
- Bilinear interpolation using SIMD☆23Updated 3 years ago
- flexible-gemm conv of deepcore☆17Updated 5 years ago
- IEEE 754-based c++ half-precision floating point library forked from http://half.sourceforge.net☆23Updated 3 years ago