Wunkolo / qreverseLinks
A small study in hardware accelerated AoS reversal
☆175Updated 6 years ago
Alternatives and similar repositories for qreverse
Users that are interested in qreverse are comparing it to the libraries listed below
Sorting:
- Storage for my snippets, toy programs, etc.☆362Updated 4 months ago
- A terminal viewer for x86 instruction/intrinsic information using Python 3 + curses☆128Updated 2 years ago
- A simple, extensible, portable, efficient and header-only SIMD library!☆230Updated 3 years ago
- Very low-overhead timer/counter interfaces for C on Intel 64 processors.☆135Updated 5 years ago
- SIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification☆259Updated 3 years ago
- A C/C++ header file for fast 32-bit division remainders (and divisibility tests) on 64-bit hardware.☆323Updated 8 months ago
- uops.info Code Analyzer☆281Updated last year
- Heap Layers: An Extensible Memory Allocation Infrastructure☆401Updated last month
- bad_alloc Behaving Badly☆74Updated 6 years ago
- immintrin_dbg.h is an include file, a wrapper around immintrin.h. It implements most of AVX, AVX2, AVX-512 vector intrinsics to enable so…☆56Updated 2 years ago
- Fast random number generators: Vectorized (SIMD) version of xorshift128+☆117Updated 5 years ago
- POSIX equivalent of Windows DLL import libraries☆258Updated 4 months ago
- Object Introspection (OI) enables on-demand, hierarchical profiling of objects in arbitrary C/C++ programs with no recompilation.☆176Updated this week
- Clang with JIT extensions☆232Updated 2 years ago
- Microbenchmarking for Modern C++☆225Updated 4 years ago
- This repository contains high-performance implementations of memset and memcpy in assembly.☆331Updated 3 years ago
- Reworking of Agner Fog's performance test programs for Linux☆113Updated 6 years ago
- Reference implementation of Grisu-Exact in C++☆67Updated 4 years ago
- Optimized CppSPMD test project: macro control flow, SSE4.1/AVX1/AVX2/AVX2 FMA support☆119Updated 4 years ago
- Measuring cmov vs branch-mov performance☆89Updated 7 years ago
- A fast, small C/C++ function call tracer for x86-64/Linux, supports clang & gcc, ftrace, threads, exceptions & shared libraries☆178Updated 4 months ago
- A cross-platform C function to get the cache line size (in bytes) of the processor, or 0 on failure☆123Updated 3 years ago
- ZP7: Zach's Peppy Parallel-Prefix-Popcountin' PEXT/PDEP Polyfill☆54Updated 11 months ago
- X86 CPU topics overview for developers , oriented towards performance☆200Updated 5 months ago
- Forward, no matter what.☆132Updated 3 years ago
- SIMD (SSE) string functions☆102Updated 8 years ago
- Concurrent Deferred Reference Counting☆165Updated last year
- User-oriented fork of LLVM's opt-viewer☆143Updated 3 weeks ago
- Portable C++ SIMD library☆173Updated 5 years ago
- Prints values and types during compilation!☆58Updated last month