BodhiHu / L-Mul
C implementation of the L-Mul f32/f16 multiplications from paper: https://arxiv.org/html/2410.00907
☆27Updated 5 months ago
Alternatives and similar repositories for L-Mul:
Users that are interested in L-Mul are comparing it to the libraries listed below
- C23 Checked Arithmetic☆122Updated 3 months ago
- A fast implementation of log() and exp()☆53Updated 2 years ago
- Wyrm is a GCC GIMPLE to LLVM IR transpiler☆55Updated last year
- A tagged-pointer type for C++.☆31Updated last year
- A collection of some lockfree datastructures☆59Updated last year
- A header-only C++ library for writing compiler/interpreter frontends.☆14Updated 2 weeks ago
- SIMD accelerated method to get the average color of an RGBA8 image☆49Updated 2 months ago
- Can I make an *optimizing* compiler under 1k lines of code?☆55Updated last month
- Rutgers APL correctly rounded math library☆29Updated 4 years ago
- A fast, zero dependency, single-header WebAssembly interpreter☆35Updated last year
- A header-only portability and boilerplate library for C☆22Updated last year
- Modeling futexes in TLA+☆21Updated 5 months ago
- Fast vectorized (SSE 4.1) range coder for 8-bit alphabets☆25Updated last year
- Implementation of destination-driven code generation with control destinations. See [post.md](post.md)☆24Updated 4 months ago
- Bytecode interpreter☆72Updated 2 months ago
- A combined repository for all RLIBM prototypes☆45Updated 6 months ago
- A floating point arithmetic which works with types of any mantissa, exponent or base in modern header-only C++.☆80Updated 5 months ago
- Small, easy-to-integrate shader compiler written in C99. Compiles HLSL to SPIR-V☆49Updated 4 years ago
- xxHash Cleaner C Reference Implementation☆43Updated 4 years ago
- A tiny CPU rasterization engine accompanying a tutorial series on writing a CPU rasterizer☆84Updated 4 months ago
- Tiny optimizing JIT compiler backend.☆45Updated 2 months ago
- Optimised x86-64 gzip decompressor☆29Updated 7 years ago
- InstLatX64_Demo☆42Updated last month
- The little FFT library☆15Updated 8 months ago
- moderngpu algorithms for C++ shaders☆16Updated 4 years ago
- Info on enabling AVX-512 on Alder Lake☆42Updated 2 years ago
- ☆174Updated last year
- A collection of Fast Fourier Transform algorithms implemented in C++20.☆111Updated last year
- Batched random number generation☆16Updated 3 months ago
- Soft floating point☆14Updated 2 weeks ago