hcho3 / skiplist_cudaLinks
A parallel (CUDA) implementation of skiplist
☆15Updated 6 years ago
Alternatives and similar repositories for skiplist_cuda
Users that are interested in skiplist_cuda are comparing it to the libraries listed below
Sorting:
- AVX512F and AVX2 versions of quick sort☆104Updated 8 years ago
- GPU B-Tree with support for versioning (snapshots).☆51Updated last year
- Pruning elements in SIMD vectors (i.e., packing left elements)☆65Updated last year
- Lock-free parallel disjoint set data structure (aka UNION-FIND) with path compression and union by rank☆67Updated 10 years ago
- Code for paper "Engineering a High-Performance GPU B-Tree" accepted to PPoPP 2019☆57Updated 3 years ago
- ☆39Updated 5 years ago
- testbed for different SIMD implementations for set intersection and set union☆41Updated 5 years ago
- Parallel Balanced Binary Tree Structures☆121Updated 9 months ago
- Themis MapReduce and TritonSort☆11Updated 8 years ago
- Fast AVX512 (AVX-512) quicksort + bitonic sort.☆28Updated 3 years ago
- ☆55Updated last year
- Sample program for article "SIMD-ized searching in unique constant dictionary" (http://0x80.pl/articles/simd-search.html)☆52Updated 8 years ago
- Daichi Amagata and Takahiro Hara, SIGMOD2021☆15Updated last year
- Code and results for our paper "Analyzing Vectorized Hash Tables Across CPU Architectures" @ VLDB '23.☆27Updated last year
- ☆12Updated 4 years ago
- SnailTrail implementation☆40Updated 6 years ago
- Concurrent Log-Structured Memory for Many-Core Key-Value Stores☆36Updated 5 years ago
- GPU-Accelerated Faster Decoding of Integer Lists☆13Updated 6 years ago
- Cycle-level, trace-driven, parallel GPU simulator for NVIDIA Pascal.☆15Updated last week
- A fully concurrent lock-free cuckoo hash table implementation☆56Updated 8 years ago
- ☆15Updated 6 years ago
- ssmem is a simple object-based memory allocator with epoch-based garbage collection☆34Updated 9 years ago
- SIMD recipes, for various platforms (collection of code snippets)☆49Updated 4 years ago
- Scalable Structural Index Constructor for JSON Analytics☆26Updated last year
- ☆22Updated 4 years ago
- A fast and highly scalable GPU dynamic memory allocator☆110Updated 10 years ago
- Unit benchmarks of CUDA event APIs.☆17Updated last year
- Fast and highly tuned bit vector implementation including space efficient rank and select support having only 3.51% space overhead.☆34Updated 8 months ago
- Fast C header-only library for popcnt, pospopcnt, and set algebraic operations☆46Updated 6 years ago
- Detailed experiments over simdjson for VLDB article (2019) and future work☆21Updated 5 years ago