BodhiHu / L-MulLinks
C implementation of the L-Mul f32/f16 multiplications from paper: https://arxiv.org/html/2410.00907
☆27Updated 8 months ago
Alternatives and similar repositories for L-Mul
Users that are interested in L-Mul are comparing it to the libraries listed below
Sorting:
- A fast implementation of log() and exp()☆53Updated 2 years ago
- C23 Checked Arithmetic☆133Updated 6 months ago
- A header only library implementing common mathematical functions using SIMD intrinsics☆108Updated 4 months ago
- Can I make an *optimizing* compiler under 1k lines of code?☆60Updated 4 months ago
- Fast vectorized (SSE 4.1) range coder for 8-bit alphabets☆25Updated 2 years ago
- A collection of some lockfree datastructures☆64Updated 2 years ago
- Rutgers APL correctly rounded math library☆29Updated 4 years ago
- Source for the OpenCilk runtime system, based on Cheetah.☆22Updated 2 weeks ago
- A combined repository for all RLIBM prototypes☆45Updated 8 months ago
- Work Stealing Threadpool in a C Header☆23Updated 11 months ago
- SIMD accelerated method to get the average color of an RGBA8 image☆48Updated 5 months ago
- A very fast and robust 64-bit PRNG with a minimum 2^64 period.☆142Updated last week
- A rethinking of the C time library☆11Updated 4 months ago
- 8-bit Xor Filter in C99☆61Updated 5 years ago
- A floating point arithmetic which works with types of any mantissa, exponent or base in modern header-only C++.☆81Updated 8 months ago
- A header-only portability and boilerplate library for C☆22Updated last year
- A tagged-pointer type for C++.☆34Updated last year
- The little FFT library☆17Updated 10 months ago
- ☆35Updated 4 months ago
- High Level Algorithmic Skeleton CUDA Library☆29Updated last year
- ☆18Updated 11 months ago
- Compiling C to FlipJump☆90Updated 5 months ago
- ☆10Updated 4 years ago
- Yet another simple header only arena allocator for C11☆42Updated 11 months ago
- A header-only C++ library for writing compiler/interpreter frontends.☆14Updated this week
- Wyrm is a GCC GIMPLE to LLVM IR transpiler☆55Updated last year
- ☆296Updated last year
- ☆41Updated 2 years ago
- Eliminating the need for hand-crafted assembly in high-performance interpreters☆14Updated 4 years ago
- Implementation of destination-driven code generation with control destinations. See [post.md](post.md)☆24Updated 7 months ago