gary-funck / parallel-merge-sort
Shared Memory, Message Passing, and Hybrid Merge Sort: UPC, OpenMP, MPI and Hybrid Implementations
☆14Updated last year
Alternatives and similar repositories for parallel-merge-sort:
Users that are interested in parallel-merge-sort are comparing it to the libraries listed below
- Lock-free parallel disjoint set data structure (aka UNION-FIND) with path compression and union by rank☆64Updated 9 years ago
- GPU B-Tree with support for versioning (snapshots).☆47Updated 5 months ago
- Fast C header-only library for popcnt, pospopcnt, and set algebraic operations☆45Updated 5 years ago
- AVX512F and AVX2 versions of quick sort☆105Updated 7 years ago
- ☆10Updated 2 months ago
- Test if AVX vector loads and stores are atomic☆29Updated 4 years ago
- Non-blocking message passing (a C++14 MPI wrapper)☆18Updated 10 years ago
- Vectorized intersections (research code)☆15Updated 8 years ago
- Easily batch-build cmake projects!☆17Updated last month
- A C/C++ task-based programming model for shared memory and distributed parallel computing.☆71Updated 4 years ago
- mallocMC: Memory Allocator for Many Core Architectures☆55Updated last week
- A benchmark for cache efficient data structures.☆30Updated 6 years ago
- Simplified Interface to Complex Memory☆28Updated last year
- SIMD-enabled descriptive statistics (mean, variance, covariance, correlation)☆18Updated this week
- C++ Library for Object-oriented Programming with Structure of Arrays Layout☆21Updated 6 years ago
- Compute morton keys using a look-up table generated at compile-time.☆31Updated 8 years ago
- For details, see the blog post:☆32Updated last year
- Collection of Agner Fog Software☆38Updated 6 years ago
- C++ implementation of concurrent Binary Search Trees☆72Updated 9 years ago
- ☆31Updated 3 years ago
- Code for paper "Engineering a High-Performance GPU B-Tree" accepted to PPoPP 2019☆55Updated 2 years ago
- Sample program for article "SIMD-ized searching in unique constant dictionary" (http://0x80.pl/articles/simd-search.html)☆52Updated 7 years ago
- String map implementation through Fast Succinct Trie☆21Updated 3 years ago
- Really fast C++ hash table☆14Updated 7 years ago
- Experimental ranges for CUDA☆24Updated 6 years ago
- A fast and highly scalable GPU dynamic memory allocator☆104Updated 10 years ago
- Evaluating different memory managers for dynamic GPU memory☆25Updated 4 years ago
- A set of benchmarking tools used to quantify the performance of BDE-style polymorphic allocators.☆33Updated 9 years ago
- A more Pragmatic Implementation of the Lock-free, Ordered, Linked List☆17Updated 4 years ago
- Allocation benchmarks☆30Updated 8 years ago