Geolm / simd_bitonicLinks
Bitonic sort using simd (avx/neon) instructions
☆17Updated 3 years ago
Alternatives and similar repositories for simd_bitonic
Users that are interested in simd_bitonic are comparing it to the libraries listed below
Sorting:
- GPU B-Tree with support for versioning (snapshots).☆51Updated last year
- Code for paper "Engineering a High-Performance GPU B-Tree" accepted to PPoPP 2019☆57Updated 3 years ago
- C++ interfaces for RDMA access☆83Updated 2 weeks ago
- AVX512F and AVX2 versions of quick sort☆104Updated 8 years ago
- Parallel Balanced Binary Tree Structures☆121Updated 9 months ago
- Encapsulate the frequently used AVX instructions as independent modules to reduce repeated development workload.☆129Updated last year
- Code and results for our paper "Analyzing Vectorized Hash Tables Across CPU Architectures" @ VLDB '23.☆27Updated last year
- Library for lock-free locks☆83Updated 2 years ago
- User-space Page Management☆111Updated last year
- A lock-free priority queue implementation☆35Updated 7 years ago
- ☆56Updated last year
- C++ bindings & containers for libpmemobj☆110Updated 2 years ago
- A GPU accelerated error-bounded lossy compression for scientific data.☆94Updated last week
- GPUfs - File system support for NVIDIA GPUs☆99Updated 7 years ago
- Intel® Query Processing Library (Intel® QPL)☆106Updated 3 weeks ago
- GPU-Accelerated Lossless Data Compressors Survey☆121Updated 5 years ago
- Ocolos is the first open-sourced online code layout optimization system for unmodified applications written in unmanaged languages.☆53Updated last week
- Montage is a system for building fast buffered persistent data structures on nonvolatile memory.☆16Updated 3 years ago
- This is a mirror of the official libpfm4 git repository, https://sourceforge.net/p/perfmon2/libpfm4/ci/master/tree/ with some local branc…☆69Updated last year
- Packed Memory Array☆17Updated 11 years ago
- Pointer-chasing memory benchmark (forked from Doug Pase's code).☆59Updated 12 years ago
- Parallel Memory Bandwidth Measurement / Benchmark Tool☆115Updated 3 years ago
- An easy-to-use, header-only C++ wrapper for Linux' perf event API☆137Updated this week
- A User-Transparent Block Cache Enabling High-Performance Out-of-Core Processing with In-Memory Programs☆75Updated last month
- The Farm-SVE package provides a header that implements the ARM C language extensions (ACLE) for the ARM Scalable Vector Extension (SVE) i…☆14Updated last year
- A Toolkit for Programming Parallel Algorithms on Shared-Memory Multicore Machines☆399Updated last month
- The Berkeley Container Library☆126Updated 3 weeks ago
- a CUDA implementation of a priority queue☆84Updated 5 years ago
- Benchmarking suite for Google workloads☆138Updated last week
- Utilities to measure read access times of caches, memory, and hardware prefetches for simple and fused operations☆85Updated 2 years ago