zwegner / zp7Links
ZP7: Zach's Peppy Parallel-Prefix-Popcountin' PEXT/PDEP Polyfill
☆53Updated 11 months ago
Alternatives and similar repositories for zp7
Users that are interested in zp7 are comparing it to the libraries listed below
Sorting:
- Fast random number generators: Vectorized (SIMD) version of xorshift128+☆117Updated 5 years ago
- ☆39Updated 4 years ago
- Radix sorting from the ground up☆37Updated last year
- Pruning elements in SIMD vectors (i.e., packing left elements)☆65Updated last year
- Fast Hash Functions Using AES Intrinsics☆82Updated 5 years ago
- Random Number Generator based on hardware-accelerated AES instructions☆59Updated 6 years ago
- UMASH: a fast enough hash and fingerprint with collision bounds☆157Updated 6 months ago
- Storage for my snippets, toy programs, etc.☆361Updated 4 months ago
- InstLatX64_Demo☆44Updated 2 months ago
- Testing framework for the quest to find a fast & strong mixer, e. g for hashtables.☆40Updated 4 years ago
- uops.info Code Analyzer☆280Updated last year
- A fast alternative to the modulo reduction☆311Updated 4 years ago
- A C/C++ header file for fast 32-bit division remainders (and divisibility tests) on 64-bit hardware.☆323Updated 8 months ago
- A benchmark for sorting algorithms☆61Updated 4 years ago
- LZ77/LZSS designed for SSE based decompression☆142Updated 6 years ago
- Very low-overhead timer/counter interfaces for C on Intel 64 processors.☆135Updated 5 years ago
- Vectorized version of the PCG random number generator☆80Updated 4 months ago
- Testing common random-number generators (RNG)☆186Updated 2 years ago
- The libsais library provides fast linear-time construction of suffix array (SA), generalized suffix array (GSA), longest common prefix (L…☆215Updated last month
- A C library for runtime-flippable feature flags on Linux/x86-64, with negligible overhead in the common case☆73Updated 2 years ago
- A terminal viewer for x86 instruction/intrinsic information using Python 3 + curses☆128Updated 2 years ago
- In-place Parallel Super Scalar Samplesort (IPS⁴o)☆121Updated 6 months ago
- Fast CRC32 implementations☆83Updated last month
- 8-bit Xor Filter in C99☆61Updated 5 years ago
- Different algorithms for converting binary to decimal floating-point numbers☆121Updated 2 years ago
- Fastest Histogram Construction☆70Updated 3 years ago
- SIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification☆258Updated 3 years ago
- My very own vxsort re-implemented with "modern" C++ by a complete idiot (in C++)☆30Updated last year
- RLibm for 32-bit representations (float and posit32)☆61Updated 3 years ago
- Intriman is a documentation generator that retargets the Intel Intrinsics Guide to other documentation formats☆28Updated 3 years ago