komrad36 / SortingNetworksLinks
Fastest CPU SIMD (SSE4) sorting networks for small integer arrays (2-6 elements), also optimal amd64 assembly and notes on getting compilers to generate optimal sorting networks.
☆52Updated 4 years ago
Alternatives and similar repositories for SortingNetworks
Users that are interested in SortingNetworks are comparing it to the libraries listed below
Sorting:
- Looking into the performance of heaps, starting with the Min-Max Heap☆69Updated 5 years ago
- A hash table implementation using Robin Hood Linear Probing☆52Updated 11 years ago
- Random Number Generator based on hardware-accelerated AES instructions☆61Updated 6 years ago
- Generic SIMD intrinsic to allow for portable SIMD intrinsic programming☆42Updated 11 years ago
- Base64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)☆170Updated 11 months ago
- Benchmarks for our memory allocator☆70Updated 2 years ago
- Comparing linear and binary searches☆44Updated 5 years ago
- Vectorized version of the PCG random number generator☆87Updated 3 weeks ago
- ☆113Updated 2 years ago
- Fast random number generators: Vectorized (SIMD) version of xorshift128+☆122Updated 5 years ago
- Testing framework for the quest to find a fast & strong mixer, e. g for hashtables.☆40Updated 5 years ago
- A benchmark for sorting algorithms☆63Updated 4 years ago
- Unstable O(n log n) sorting algorithm with O(n) memory☆67Updated 3 months ago
- For details, see the blog post:☆35Updated 2 years ago
- TLSF: two-level segregated fit O(1) allocator☆83Updated 3 years ago
- Very low-overhead timer/counter interfaces for C on Intel 64 processors.☆140Updated 3 months ago
- Pruning elements in SIMD vectors (i.e., packing left elements)☆68Updated 2 years ago
- Fast Hash Functions Using AES Intrinsics☆85Updated 6 years ago
- ☆39Updated 5 years ago
- Fast multi-threaded memory allocator☆79Updated 6 years ago
- Sample program for article "SIMD-ized searching in unique constant dictionary" (http://0x80.pl/articles/simd-search.html)☆52Updated 8 years ago
- A Fast, Multicore-Scalable, Low-Fragmentation Memory Allocator☆162Updated 7 months ago
- Bitset Sort, a faster std::sort replacement.☆79Updated 4 years ago
- Mutexes that only use two bits. Either stored in a byte, or in the bottom bits of a pointer☆26Updated 3 years ago
- Bloom filter alternative (C++)☆18Updated 7 years ago
- Fast differential coding functions (using SIMD instructions)☆55Updated 8 years ago
- Fastest Histogram Construction☆75Updated 4 years ago
- Coroutines/Fibers implementation for x86☆65Updated 9 years ago
- SIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification☆261Updated 4 years ago
- A terminal viewer for x86 instruction/intrinsic information using Python 3 + curses☆128Updated 3 years ago