Geolm / simd_bitonic
Bitonic sort using simd (avx/neon) instructions
☆14Updated 3 years ago
Alternatives and similar repositories for simd_bitonic:
Users that are interested in simd_bitonic are comparing it to the libraries listed below
- GPU B-Tree with support for versioning (snapshots).☆47Updated 5 months ago
- Code for paper "Engineering a High-Performance GPU B-Tree" accepted to PPoPP 2019☆55Updated 2 years ago
- AVX512F and AVX2 versions of quick sort☆105Updated 7 years ago
- Code and results for our paper "Analyzing Vectorized Hash Tables Across CPU Architectures" @ VLDB '23.☆24Updated last year
- C++ interfaces for RDMA access☆68Updated 2 weeks ago
- ☆53Updated 10 months ago
- Pointer-chasing memory benchmark (forked from Doug Pase's code).☆59Updated 11 years ago
- GPU-Accelerated Lossless Data Compressors Survey☆114Updated 4 years ago
- A GPU-Accelerated In-Memory Key-Value Store (AWS-focused fork)☆28Updated 7 years ago
- Library for lock-free locks☆77Updated last year
- The Farm-SVE package provides a header that implements the ARM C language extensions (ACLE) for the ARM Scalable Vector Extension (SVE) i…☆14Updated last year
- A User-Transparent Block Cache Enabling High-Performance Out-of-Core Processing with In-Memory Programs☆74Updated 2 years ago
- ☆16Updated 10 months ago
- Radix sorting from the ground up☆36Updated last year
- A lock-free priority queue implementation☆34Updated 6 years ago
- TLB Benchmarks☆33Updated 7 years ago
- Ocolos is the first online code layout optimization system for unmodified applications written in unmanaged languages.☆52Updated last year
- ☆20Updated 2 years ago
- ☆29Updated this week
- JSONPath Streaming with Bit-Parallel Fast-Forwarding☆25Updated 5 months ago
- Flash Perfect Hash Table: an implementation of a dynamic perfect hash table, extremely fast for lookup☆42Updated last year
- Parallel Balanced Binary Tree Structures☆114Updated 2 weeks ago
- ☆17Updated 3 months ago
- Optimistic queue-based reader-writer lock for robust index synchronization (SIGMOD 2024)☆24Updated 10 months ago
- Fast C header-only library for popcnt, pospopcnt, and set algebraic operations☆45Updated 5 years ago
- ☆34Updated 3 years ago
- A low level, low latency library, which can be used to accelerate network messages using shared memory and RDMA☆75Updated 4 years ago
- testbed for different SIMD implementations for set intersection and set union☆41Updated 5 years ago
- This is a mirror of the official libpfm4 git repository, https://sourceforge.net/p/perfmon2/libpfm4/ci/master/tree/ with some local branc…☆57Updated 5 months ago
- ☆12Updated last year