Wunkolo / qreverse
A small study in hardware accelerated AoS reversal
☆174Updated 6 years ago
Alternatives and similar repositories for qreverse:
Users that are interested in qreverse are comparing it to the libraries listed below
- A terminal viewer for x86 instruction/intrinsic information using Python 3 + curses☆128Updated 2 years ago
- A simple, extensible, portable, efficient and header-only SIMD library!☆230Updated 3 years ago
- uops.info Code Analyzer☆256Updated last year
- Optimized CppSPMD test project: macro control flow, SSE4.1/AVX1/AVX2/AVX2 FMA support☆116Updated 4 years ago
- Very low-overhead timer/counter interfaces for C on Intel 64 processors.☆121Updated 5 years ago
- A fast alternative to the modulo reduction☆308Updated 3 years ago
- Storage for my snippets, toy programs, etc.☆349Updated 2 weeks ago
- A C/C++ header file for fast 32-bit division remainders (and divisibility tests) on 64-bit hardware.☆307Updated 3 months ago
- ZP7: Zach's Peppy Parallel-Prefix-Popcountin' PEXT/PDEP Polyfill☆49Updated 6 months ago
- bad_alloc Behaving Badly☆74Updated 5 years ago
- Heap Layers: An Extensible Memory Allocation Infrastructure☆389Updated 6 months ago
- Microbenchmarking for Modern C++☆218Updated 4 years ago
- immintrin_dbg.h is an include file, a wrapper around immintrin.h. It implements most of AVX, AVX2, AVX-512 vector intrinsics to enable so…☆57Updated 2 years ago
- Clang with JIT extensions☆228Updated 2 years ago
- Fast random number generators: Vectorized (SIMD) version of xorshift128+☆113Updated 4 years ago
- ☆75Updated 2 years ago
- A spicy text library for C++ that has the explicit goal of enabling the entire ecosystem to share in proper forward progress towards a br…☆317Updated 5 months ago
- CPU Ultimate Latency Test.☆107Updated last year
- In-place Parallel Super Scalar Samplesort (IPS⁴o)☆117Updated last month
- This repository contains high-performance implementations of memset and memcpy in assembly.☆318Updated 3 years ago
- Sample implementation of C++20 atomic_wait/notify☆59Updated 5 years ago
- ☆114Updated 5 years ago
- C library to remove white space from strings as fast as possible☆152Updated 5 months ago
- Policy Based C++ Allocator Library☆123Updated 7 years ago
- Reworking of Agner Fog's performance test programs for Linux☆110Updated 5 years ago
- Fastest CPU SIMD (SSE4) sorting networks for small integer arrays (2-6 elements), also optimal amd64 assembly and notes on getting compil…☆45Updated 3 years ago
- Portable C++ SIMD library☆174Updated 5 years ago
- Create man pages from information used by Intel Intrinsics Guide and optionally uops.info☆45Updated 2 months ago
- Reference implementation of Grisu-Exact in C++☆62Updated 4 years ago