Geolm / simd_bitonicLinks
Bitonic sort using simd (avx/neon) instructions
☆17Updated 3 years ago
Alternatives and similar repositories for simd_bitonic
Users that are interested in simd_bitonic are comparing it to the libraries listed below
Sorting:
- GPU B-Tree with support for versioning (snapshots).☆51Updated last year
- AVX512F and AVX2 versions of quick sort☆104Updated 8 years ago
- Code for paper "Engineering a High-Performance GPU B-Tree" accepted to PPoPP 2019☆58Updated 3 years ago
- C++ interfaces for RDMA access☆83Updated last month
- GPU-Accelerated Lossless Data Compressors Survey☆122Updated 5 years ago
- Encapsulate the frequently used AVX instructions as independent modules to reduce repeated development workload.☆130Updated 2 years ago
- Library for lock-free locks☆83Updated 2 years ago
- Intel® Query Processing Library (Intel® QPL)☆106Updated last month
- Code and results for our paper "Analyzing Vectorized Hash Tables Across CPU Architectures" @ VLDB '23.☆28Updated last year
- ☆57Updated last year
- An easy-to-use, header-only C++ wrapper for Linux' perf event API☆137Updated 3 weeks ago
- Universal Presentation: A Header-only C++ Library to Cout STL containers and more☆18Updated 2 years ago
- Code of the paper "Building an Efficient Key-Value Store in a Flexible Address Space", EuroSys '22☆21Updated 10 months ago
- A Toolkit for Programming Parallel Algorithms on Shared-Memory Multicore Machines☆418Updated 2 months ago
- A User-Transparent Block Cache Enabling High-Performance Out-of-Core Processing with In-Memory Programs☆76Updated last month
- Radix sorting from the ground up☆37Updated 2 years ago
- This is a mirror of the official libpfm4 git repository, https://sourceforge.net/p/perfmon2/libpfm4/ci/master/tree/ with some local branc…☆70Updated last year
- A lock-free priority queue implementation☆35Updated 7 years ago
- Parallel Balanced Binary Tree Structures☆122Updated 10 months ago
- A GPU accelerated error-bounded lossy compression for scientific data.☆94Updated 3 weeks ago
- LANL no longer develops PLFS. Feel free to fork and develop as you wish.☆42Updated 11 years ago
- A High-Throughput Parallel Lossless Compressor for Scientific Data☆75Updated 3 years ago
- Packed Memory Array☆17Updated 11 years ago
- Boki: Stateful Serverless Computing with Shared Logs [SOSP '21]☆84Updated 3 years ago
- Montage is a system for building fast buffered persistent data structures on nonvolatile memory.☆16Updated 3 years ago
- C++ bindings & containers for libpmemobj☆110Updated 2 years ago
- Ocolos is the first open-sourced online code layout optimization system for unmodified applications written in unmanaged languages.☆53Updated 3 weeks ago
- Example code for Intel AVX / AVX2 intrinsics.☆144Updated 2 years ago
- a CUDA implementation of a priority queue☆84Updated 5 years ago
- The Berkeley Container Library☆126Updated last month