Wunkolo / qreverse
A small study in hardware accelerated AoS reversal
☆173Updated 6 years ago
Alternatives and similar repositories for qreverse:
Users that are interested in qreverse are comparing it to the libraries listed below
- A terminal viewer for x86 instruction/intrinsic information using Python 3 + curses☆128Updated 2 years ago
- A simple, extensible, portable, efficient and header-only SIMD library!☆229Updated 3 years ago
- A C/C++ header file for fast 32-bit division remainders (and divisibility tests) on 64-bit hardware.☆312Updated 5 months ago
- uops.info Code Analyzer☆267Updated last year
- This repository contains high-performance implementations of memset and memcpy in assembly.☆328Updated 3 years ago
- bad_alloc Behaving Badly☆74Updated 5 years ago
- Very low-overhead timer/counter interfaces for C on Intel 64 processors.☆133Updated 5 years ago
- Optimized CppSPMD test project: macro control flow, SSE4.1/AVX1/AVX2/AVX2 FMA support☆117Updated 4 years ago
- Storage for my snippets, toy programs, etc.☆353Updated last month
- low-level library for minimizing the size of your types☆113Updated 5 years ago
- Heap Layers: An Extensible Memory Allocation Infrastructure☆394Updated 2 weeks ago
- Clang with JIT extensions☆229Updated 2 years ago
- Fast random number generators: Vectorized (SIMD) version of xorshift128+☆115Updated 4 years ago
- Microbenchmarking for Modern C++☆219Updated 4 years ago
- immintrin_dbg.h is an include file, a wrapper around immintrin.h. It implements most of AVX, AVX2, AVX-512 vector intrinsics to enable so…☆57Updated 2 years ago
- Fast multi-threaded memory allocator☆78Updated 5 years ago
- Reworking of Agner Fog's performance test programs for Linux☆110Updated 6 years ago
- Object Introspection (OI) enables on-demand, hierarchical profiling of objects in arbitrary C/C++ programs with no recompilation.☆172Updated this week
- A micro microbenchmarking library for C++11 in a single header file☆217Updated 3 weeks ago
- A tool for running small microbenchmarks on recent Intel and AMD x86 CPUs.☆465Updated last month
- A drop-in replacement for std::list with 293% faster insertion, 57% faster erasure, 17% faster iteration and 77% faster sorting on averag…☆154Updated last month
- A modern interface for implementing bulk-synchronous parallel programs.☆94Updated 2 years ago
- C library implementing the ridiculously fast CLHash hashing function☆277Updated last year
- Code for benchmarking of mutexes to accompany a blog post of mine.☆28Updated 5 years ago
- A fast alternative to the modulo reduction☆309Updated 4 years ago
- Portable C++ SIMD library☆174Updated 5 years ago
- Programatically obtain information about the pages backing a given memory region☆75Updated 3 years ago
- The Berkeley Container Library☆124Updated last year
- Light, fast, threadpool for C++20☆101Updated 2 years ago
- Eliminate all the tedious hassle when making state-of-the-art C++ 14 - 23 libraries!☆168Updated 3 weeks ago