jonicho / simd-radix-sortLinks
A generic and efficient SIMD implementation of MSB Radix Sort with separate key and payload datastreams that supports arbitrary key and payload data types written in C++ accompanied by a bachelor's thesis.
☆13Updated 5 months ago
Alternatives and similar repositories for simd-radix-sort
Users that are interested in simd-radix-sort are comparing it to the libraries listed below
Sorting:
- CLI utilty to work out proper constants for vpternlogic instruction☆13Updated 2 years ago
- benchmarking positional population count☆15Updated last year
- immintrin_dbg.h is an include file, a wrapper around immintrin.h. It implements most of AVX, AVX2, AVX-512 vector intrinsics to enable so…☆56Updated 2 years ago
- AVX implementation of the Skinny block cipher☆11Updated 3 years ago
- Create man pages from information used by Intel Intrinsics Guide and optionally uops.info☆45Updated 7 months ago
- InstLatX64_Demo☆43Updated last month
- Random Number Generator based on hardware-accelerated AES instructions☆59Updated 6 years ago
- ZP7: Zach's Peppy Parallel-Prefix-Popcountin' PEXT/PDEP Polyfill☆53Updated 11 months ago
- Performance Counter Measurements at the cycle granularity☆18Updated 4 years ago
- Benchmarks for our memory allocator☆67Updated last year
- Sample program for article "SIMD-ized searching in unique constant dictionary" (http://0x80.pl/articles/simd-search.html)☆52Updated 8 years ago
- Fast C header-only library for popcnt, pospopcnt, and set algebraic operations☆45Updated 5 years ago
- Interleaving bits from two sources using SIMD instructions.☆14Updated 7 years ago
- Library with JIT (Just-in-time) compilation support to optimize performance of small and medium matrix multiplication☆14Updated 4 years ago
- Markup source code showing optimizations☆36Updated 5 years ago
- A test library for computing modular exponentiation in parallel using AVX-512 vector arithmetic☆12Updated last year
- ☆39Updated 4 years ago
- Fast CRC32 implementations☆80Updated 3 weeks ago
- A lock-free malloc(3) implementation☆19Updated last month
- Quick sort code using AVX2 instructions☆69Updated 8 years ago
- O(n log n) In-place Stable Quicksort☆18Updated 7 months ago
- Extremely fast x86 / AVX2 assembly implementations of common operations for linear arrays: checking whether array contains element, findi…☆38Updated 3 years ago
- unrolled skip list library for C☆21Updated 6 years ago
- Fastest CPU SIMD (SSE4) sorting networks for small integer arrays (2-6 elements), also optimal amd64 assembly and notes on getting compil…☆47Updated 4 years ago
- Testing framework for the quest to find a fast & strong mixer, e. g for hashtables.☆40Updated 4 years ago
- A bit mixer, pseudo random number generator and a hash function.☆28Updated 3 years ago
- Support for ternary logic in SSE, XOP, AVX2 and x86 programs☆31Updated 6 months ago
- AVX-512 utilities☆20Updated 10 years ago
- My very own vxsort re-implemented with "modern" C++ by a complete idiot (in C++)☆30Updated last year
- Very low-overhead timer/counter interfaces for C on Intel 64 processors.☆135Updated 5 years ago