gary-funck / parallel-merge-sort
Shared Memory, Message Passing, and Hybrid Merge Sort: UPC, OpenMP, MPI and Hybrid Implementations
☆14Updated 11 months ago
Alternatives and similar repositories for parallel-merge-sort:
Users that are interested in parallel-merge-sort are comparing it to the libraries listed below
- AVX512F and AVX2 versions of quick sort☆105Updated 7 years ago
- GPU B-Tree with support for versioning (snapshots).☆45Updated 2 months ago
- Sample program for article "SIMD-ized searching in unique constant dictionary" (http://0x80.pl/articles/simd-search.html)☆52Updated 7 years ago
- DSL for SIMD Sorting on AVX2 & AVX512☆29Updated 6 years ago
- Fast C header-only library for popcnt, pospopcnt, and set algebraic operations☆45Updated 5 years ago
- For details, see the blog post:☆32Updated last year
- mallocMC: Memory Allocator for Many Core Architectures☆53Updated this week
- String map implementation through Fast Succinct Trie☆21Updated 3 years ago
- Optimal partitioning of Variable-Byte codes for better compression and fast decoding.☆17Updated 3 years ago
- Lock-free parallel disjoint set data structure (aka UNION-FIND) with path compression and union by rank☆63Updated 9 years ago
- Super Scalar Sample Sort in modern C++☆20Updated 4 years ago
- Benchmarks for our memory allocator☆62Updated last year
- Test if AVX vector loads and stores are atomic☆27Updated 4 years ago
- Code for paper "Engineering a High-Performance GPU B-Tree" accepted to PPoPP 2019☆54Updated 2 years ago
- Hash table with separate chaining layout☆9Updated 3 years ago
- UME::SIMD A library for explicit simd vectorization.☆91Updated 7 years ago
- Vectorized intersections (research code)☆14Updated 8 years ago
- A fully concurrent lock-free cuckoo hash table implementation☆56Updated 7 years ago
- ☆31Updated 3 years ago
- testbed for different SIMD implementations for set intersection and set union☆41Updated 4 years ago
- A more Pragmatic Implementation of the Lock-free, Ordered, Linked List☆17Updated 4 years ago
- Fast in-place radix sort with STL-like API☆46Updated 8 years ago
- A C++ container-like data structure for storing a vector of bits with fast appending on both sides and fast insertion in the middle, all …☆22Updated 2 years ago
- A Benchmark of Minimal Perfect Hash Function Algorithms.☆35Updated 2 years ago
- Giddy - A lightweight GPU decompression library☆42Updated 5 years ago
- Parallel C++ algorithms☆58Updated 8 months ago
- An exploratory implementation of bit overloads of the C++ standard algorithms☆18Updated 3 years ago
- CCProf: Lightweight Detection of Cache Conflicts☆25Updated 3 years ago
- A C++ implementation of cache-oblivious Van Emde Boas search trees☆19Updated 6 years ago
- A comparison study between sequential sorting algorithms implemented in C++ and parallel sorting algorithms implemented in CUDA as part o…☆61Updated 3 years ago