berenger-eu / avx-512-sort
Fast AVX512 (AVX-512) quicksort + bitonic sort.
☆28Updated 2 years ago
Alternatives and similar repositories for avx-512-sort
Users that are interested in avx-512-sort are comparing it to the libraries listed below
Sorting:
- InstLatX64_Demo☆43Updated last week
- NUMA-Aware Reader-Writer Locks☆18Updated 10 years ago
- User-space Page Management☆107Updated 9 months ago
- immintrin_dbg.h is an include file, a wrapper around immintrin.h. It implements most of AVX, AVX2, AVX-512 vector intrinsics to enable so…☆57Updated 2 years ago
- ☆36Updated 10 months ago
- A Scalable, Portable, and Memory-Efficient Lock-Free FIFO Queue (DISC '19)☆57Updated last year
- AVX512F and AVX2 versions of quick sort☆105Updated 7 years ago
- A small library and kernel module for easy access to x86 performance monitor counters under Linux.☆96Updated last year
- Sample program for article "SIMD-ized searching in unique constant dictionary" (http://0x80.pl/articles/simd-search.html)☆52Updated 8 years ago
- Intel® Instrumentation and Tracing Technology (ITT) and Just-In-Time (JIT) API☆111Updated last week
- Testing memory-level parallelism☆68Updated last year
- Predator: Predictive False Sharing Detection☆21Updated 11 years ago
- ☆56Updated last month
- CERE: Codelet Extractor and REplayer☆40Updated last year
- Intel® Query Processing Library (Intel® QPL)☆103Updated last week
- Montage is a system for building fast buffered persistent data structures on nonvolatile memory.☆15Updated 2 years ago
- GPUfs - File system support for NVIDIA GPUs☆93Updated 6 years ago
- Quick sort code using AVX2 instructions☆68Updated 7 years ago
- ✈️ PTHash is a fast and compact minimal perfect hash function.☆226Updated this week
- Code used for generating charts and measurements of nontemporal stores☆9Updated 6 years ago
- This is a mirror of the official libpfm4 git repository, https://sourceforge.net/p/perfmon2/libpfm4/ci/master/tree/ with some local branc…☆62Updated 6 months ago
- Intel® Data Mover Library (Intel® DML)☆93Updated last month
- Unit benchmarks of CUDA event APIs.☆17Updated last year
- Allows safer access to model specific registers (MSRs)☆91Updated last month
- Programatically obtain information about the pages backing a given memory region☆76Updated 3 years ago
- Generic Automatic Parallel Profiler☆35Updated 4 years ago
- A trivial Linux kernel module to execute WBINVD on demand☆25Updated last year
- C++ interfaces for RDMA access☆77Updated this week
- Quicksilver superpage management system☆11Updated 4 years ago
- Emulating DMA Engines on GPUs for Performance and Portability☆40Updated 10 years ago