Wunkolo / qreverse
A small study in hardware accelerated AoS reversal
☆173Updated 6 years ago
Alternatives and similar repositories for qreverse:
Users that are interested in qreverse are comparing it to the libraries listed below
- A terminal viewer for x86 instruction/intrinsic information using Python 3 + curses☆128Updated 2 years ago
- Heap Layers: An Extensible Memory Allocation Infrastructure☆393Updated this week
- A simple, extensible, portable, efficient and header-only SIMD library!☆230Updated 3 years ago
- A C/C++ header file for fast 32-bit division remainders (and divisibility tests) on 64-bit hardware.☆311Updated 4 months ago
- Storage for my snippets, toy programs, etc.☆350Updated last week
- Reference implementation of Grisu-Exact in C++☆61Updated 4 years ago
- Optimized CppSPMD test project: macro control flow, SSE4.1/AVX1/AVX2/AVX2 FMA support☆117Updated 4 years ago
- Very low-overhead timer/counter interfaces for C on Intel 64 processors.☆129Updated 5 years ago
- A fast alternative to the modulo reduction☆309Updated 4 years ago
- Fast multi-threaded memory allocator☆79Updated 5 years ago
- uops.info Code Analyzer☆262Updated last year
- Fast random number generators: Vectorized (SIMD) version of xorshift128+☆113Updated 4 years ago
- SIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification☆246Updated 3 years ago
- User-oriented fork of LLVM's opt-viewer☆140Updated 6 months ago
- bad_alloc Behaving Badly☆74Updated 5 years ago
- ☆105Updated last year
- immintrin_dbg.h is an include file, a wrapper around immintrin.h. It implements most of AVX, AVX2, AVX-512 vector intrinsics to enable so…☆57Updated 2 years ago
- low-level library for minimizing the size of your types☆113Updated 5 years ago
- Microbenchmarking for Modern C++☆219Updated 4 years ago
- Measuring cmov vs branch-mov performance☆85Updated 7 years ago
- LightweighT Almost Lock-Less Oriented for C++ programs memory allocator☆166Updated 6 years ago
- Fastest CPU SIMD (SSE4) sorting networks for small integer arrays (2-6 elements), also optimal amd64 assembly and notes on getting compil…☆45Updated 4 years ago
- BitMagic Library☆422Updated this week
- 🚀 Fast C/C++ bit population count library☆339Updated 9 months ago
- Clang from the Future: A C++17 to C++11 source-to-source compiler☆124Updated 6 years ago
- Create man pages from information used by Intel Intrinsics Guide and optionally uops.info☆45Updated 3 months ago
- Intriman is a documentation generator that retargets the Intel Intrinsics Guide to other documentation formats☆28Updated 2 years ago
- Bit containers, sequences, and views for everyone. 🕷️☆122Updated 2 years ago
- Concurrent Deferred Reference Counting☆159Updated last year
- Reference implementation of Dragonbox in C++☆638Updated 5 months ago