vkrasnov / vpmadd
Multiplication using AVX512 and AVX512IFMA instructions
☆23Updated 9 years ago
Alternatives and similar repositories for vpmadd:
Users that are interested in vpmadd are comparing it to the libraries listed below
- I-cache line packing and branch misprediction measuring tool☆17Updated 8 years ago
- like ChaCha, but 64-bit instead of 32-bit thanks to BLAKE2b's permutation☆15Updated 7 years ago
- A small DFA for under 16 states☆51Updated 6 years ago
- reverse engineering branch predictors☆17Updated 8 years ago
- ssmem is a simple object-based memory allocator with epoch-based garbage collection☆34Updated 8 years ago
- Extended Roofline Model - LLVM source tree with additional libraries for the analysis of the dynamic execution in the interpreter☆17Updated 7 years ago
- Quick sort code using AVX2 instructions☆68Updated 7 years ago
- C library to compute the Hamming weight of arrays☆45Updated 6 years ago
- RLU resizable hash-table☆16Updated 9 years ago
- FLECC_IN_C is a FLexible Elliptic Curve Cryptography library written IN C☆18Updated 7 years ago
- finding set bits in large bitmaps☆15Updated 9 years ago
- Wren: Nonblocking Reads in a Partitioned Transactional Causally Consistent Data Store☆8Updated 6 years ago
- SIMDized check which bytes are in a set☆28Updated 6 years ago
- SIMD recipes, for various platforms (collection of code snippets)☆48Updated 3 years ago
- code for examining determinism of performance counters☆21Updated 3 years ago
- NetBSD cdb (constant database) library☆14Updated 5 years ago
- Linux kernel source tree with fast swap patches.☆20Updated 11 years ago
- A Parallelism Profiler with What-If analyses for Intel Threading Building Blocks (TBB) programs☆13Updated 7 years ago
- Sample program for article "SIMD-ized searching in unique constant dictionary" (http://0x80.pl/articles/simd-search.html)☆52Updated 7 years ago
- Feed-forward Bloom filters☆52Updated 13 years ago
- Restartable Sequences: a userspace implementation of cheap per-cpu atomic operations☆36Updated 6 years ago
- Compact tries for fixed-width keys☆25Updated 6 years ago
- Vectorized intersections (research code)☆14Updated 8 years ago
- measure entropy of memory allocators☆12Updated 3 years ago
- Wait-Free Eras (PPoPP '20)☆10Updated 5 years ago
- A software-based Ethernet switch design built around a memory-efficient, high-performance, and highly-concurrent hash table for compact a…☆34Updated 9 years ago
- AVX2 Chacha implementation☆16Updated 11 years ago
- Some variations on Lemire's Fast Random Integer Generation in an Interval☆15Updated 5 years ago
- AVX-512 utilities☆19Updated 10 years ago
- RWMutex for sharing of multicore machines.☆17Updated 5 years ago