Geolm / simd_bitonic
Bitonic sort using simd (avx/neon) instructions
☆12Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for simd_bitonic
- GPU B-Tree with support for versioning (snapshots).☆43Updated 2 weeks ago
- Code for paper "Engineering a High-Performance GPU B-Tree" accepted to PPoPP 2019☆52Updated 2 years ago
- AVX512F and AVX2 versions of quick sort☆105Updated 6 years ago
- Code and results for our paper "Analyzing Vectorized Hash Tables Across CPU Architectures" @ VLDB '23.☆23Updated 9 months ago
- This is a mirror of the official libpfm4 git repository, https://sourceforge.net/p/perfmon2/libpfm4/ci/master/tree/ with some local branc…☆55Updated 3 weeks ago
- Fast C header-only library for popcnt, pospopcnt, and set algebraic operations☆45Updated 4 years ago
- ☆21Updated last year
- ☆51Updated 6 months ago
- A GPU FP32 computation method with Tensor Cores.☆18Updated last year
- Encapsulate the frequently used AVX instructions as independent modules to reduce repeated development workload.☆114Updated 10 months ago
- benchmarking positional population count☆12Updated 8 months ago
- a CUDA implementation of a priority queue☆81Updated 4 years ago
- Radix sorting from the ground up☆35Updated 9 months ago
- GPU-Accelerated Lossless Data Compressors Survey☆110Updated 4 years ago
- The repo for HotOS paper "FIFO can be Better than LRU: the Power of Lazy Promotion and Quick Demotion"☆32Updated last year
- InstLatX64_Demo☆41Updated this week
- ☆28Updated 2 weeks ago
- A User-Transparent Block Cache Enabling High-Performance Out-of-Core Processing with In-Memory Programs☆74Updated last year
- NUMA-Aware Reader-Writer Locks☆18Updated 10 years ago
- testbed for different SIMD implementations for set intersection and set union☆40Updated 4 years ago
- ☆40Updated 7 years ago
- Pointer-chasing memory benchmark (forked from Doug Pase's code).☆58Updated 10 years ago
- A low level, low latency library, which can be used to accelerate network messages using shared memory and RDMA☆70Updated 3 years ago
- Code of the paper "Building an Efficient Key-Value Store in a Flexible Address Space", EuroSys '22☆21Updated 5 months ago
- Fast AVX512 (AVX-512) quicksort + bitonic sort.☆26Updated 2 years ago
- Benchmarking tools for pmemkv☆22Updated last year
- ☆15Updated 4 years ago
- My notes on various HPC papers.☆21Updated last year
- A lock-free priority queue implementation☆31Updated 6 years ago
- ☆32Updated 3 years ago