mis-wut / feathergpuLinks
☆15Updated 8 years ago
Alternatives and similar repositories for feathergpu
Users that are interested in feathergpu are comparing it to the libraries listed below
Sorting:
- Quick sort code using AVX2 instructions☆69Updated 8 years ago
- AVX512F and AVX2 versions of quick sort☆104Updated 7 years ago
- Programatically obtain information about the pages backing a given memory region☆79Updated 3 years ago
- Sample program for article "SIMD-ized searching in unique constant dictionary" (http://0x80.pl/articles/simd-search.html)☆52Updated 8 years ago
- Fast C header-only library for popcnt, pospopcnt, and set algebraic operations☆45Updated 5 years ago
- Pruning elements in SIMD vectors (i.e., packing left elements)☆65Updated last year
- A Scalable, Portable, and Memory-Efficient Lock-Free FIFO Queue (DISC '19)☆61Updated last year
- Random Number Generator based on hardware-accelerated AES instructions☆60Updated 6 years ago
- Fast Hash Functions Using AES Intrinsics☆82Updated 6 years ago
- Information about AVX-512 support on recent Intel processors☆45Updated 3 years ago
- Benchmarks for our memory allocator☆69Updated last year
- immintrin_dbg.h is an include file, a wrapper around immintrin.h. It implements most of AVX, AVX2, AVX-512 vector intrinsics to enable so…☆57Updated 2 years ago
- ☆109Updated last year
- CLI utilty to work out proper constants for vpternlogic instruction☆13Updated 2 years ago
- Comparing linear and binary searches☆41Updated 5 years ago
- SIMD recipes, for various platforms (collection of code snippets)☆49Updated 4 years ago
- Fastest Histogram Construction☆70Updated 3 years ago
- Fast C functions for the computing the positional popcount (pospopcnt).☆55Updated 5 years ago
- Generic SIMD intrinsic to allow for portable SIMD intrinsic programming☆41Updated 11 years ago
- ☆39Updated 4 years ago
- Vectorized version of the PCG random number generator☆83Updated 5 months ago
- Lock-free multithreaded memory allocation☆108Updated 8 years ago
- SIMDized check which bytes are in a set☆28Updated 6 years ago
- Simple C++ code to benchmark fast division algorithms☆52Updated 4 years ago
- LLVM-jitted bitmaps☆27Updated 5 years ago
- A hash table implementation using Robin Hood Linear Probing☆51Updated 11 years ago
- 8-bit Xor Filter in C99☆61Updated 5 years ago
- ssmem is a simple object-based memory allocator with epoch-based garbage collection☆34Updated 9 years ago
- Radix sorting from the ground up☆37Updated last year
- A fast implementation of log() and exp()☆52Updated 2 years ago