yandex / fastopsLinks
This small library enables acceleration of bulk calls of certain math functions on AVX and AVX2 hardware. Currently supported operations are exp, log, sigmoid and tanh. The library is designed with extensibility in mind.
☆87Updated 4 years ago
Alternatives and similar repositories for fastops
Users that are interested in fastops are comparing it to the libraries listed below
Sorting:
- Transpose: SIMD Integer+Floating Point Compression Filter☆63Updated 6 years ago
- C++ library to pack and unpack vectors of integers having a small range of values using a technique called Frame of Reference☆54Updated last year
- Selection and partial sorting algorithms☆138Updated 8 months ago
- Fast random number generators: Vectorized (SIMD) version of xorshift128+☆120Updated 5 years ago
- Pruning elements in SIMD vectors (i.e., packing left elements)☆65Updated last year
- Base64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)☆170Updated 10 months ago
- SIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification☆261Updated 3 years ago
- Looking into the performance of heaps, starting with the Min-Max Heap☆69Updated 4 years ago
- Succinct data structures in C/C++☆92Updated last year
- AVX512F and AVX2 versions of quick sort☆104Updated 8 years ago
- ☆112Updated last year
- Fast differential coding functions (using SIMD instructions)☆55Updated 8 years ago
- For details, see the blog post:☆34Updated 2 years ago
- Testing framework for the quest to find a fast & strong mixer, e. g for hashtables.☆40Updated 5 years ago
- Quick sort code using AVX2 instructions☆69Updated 8 years ago
- C library to pack and unpack short arrays of integers as fast as possible☆91Updated 2 years ago
- Bitset Sort, a faster std::sort replacement.☆78Updated 3 years ago
- Fastest Histogram Construction☆73Updated 3 years ago
- Fast Hash Functions Using AES Intrinsics☆84Updated 6 years ago
- Cache-friendly associative STL-like container with an Eytzinger (BFS) layout for C++☆33Updated 8 years ago
- Fast implementations of the scancount algorithm: C++ header-only library☆27Updated 6 years ago
- Sample program for article "SIMD-ized searching in unique constant dictionary" (http://0x80.pl/articles/simd-search.html)☆52Updated 8 years ago
- Fast decoder for VByte-compressed integers☆125Updated last month
- Compile-time TRIE based string matcher (C++11)☆53Updated 6 years ago
- A fast alternative to the modulo reduction☆314Updated 4 years ago
- SIMD (SSE) population count --- http://0x80.pl/articles/sse-popcount.html☆350Updated last year
- A C/C++ header file for fast 32-bit division remainders (and divisibility tests) on 64-bit hardware.☆338Updated last year
- Code for paper "Base64 encoding and decoding at almost the speed of a memory copy"☆204Updated 6 years ago
- ☆39Updated 4 years ago
- ☆50Updated 5 years ago