berenger-eu / avx-512-sort
Fast AVX512 (AVX-512) quicksort + bitonic sort.
☆28Updated 2 years ago
Alternatives and similar repositories for avx-512-sort:
Users that are interested in avx-512-sort are comparing it to the libraries listed below
- InstLatX64_Demo☆43Updated last week
- immintrin_dbg.h is an include file, a wrapper around immintrin.h. It implements most of AVX, AVX2, AVX-512 vector intrinsics to enable so…☆57Updated 2 years ago
- Generic Automatic Parallel Profiler☆35Updated 4 years ago
- Code used for generating charts and measurements of nontemporal stores☆9Updated 6 years ago
- ☆55Updated 2 years ago
- CERE: Codelet Extractor and REplayer☆40Updated last year
- AVX512F and AVX2 versions of quick sort☆105Updated 7 years ago
- A small library and kernel module for easy access to x86 performance monitor counters under Linux.☆96Updated 11 months ago
- Ocolos is the first online code layout optimization system for unmodified applications written in unmanaged languages.☆53Updated last year
- Intel® Instrumentation and Tracing Technology (ITT) and Just-In-Time (JIT) API☆103Updated last month
- This is a mirror of the official libpfm4 git repository, https://sourceforge.net/p/perfmon2/libpfm4/ci/master/tree/ with some local branc…☆59Updated 6 months ago
- ☆36Updated 9 months ago
- Predator: Predictive False Sharing Detection☆21Updated 10 years ago
- A Scalable, Portable, and Memory-Efficient Lock-Free FIFO Queue (DISC '19)☆55Updated last year
- Persistent Memory Test Suite☆13Updated 4 years ago
- User-space Page Management☆107Updated 8 months ago
- A fast and accurate reuse distance analyzer for multi-threaded applications. It leverages existing hardware features in commodity CPUs.☆16Updated 2 years ago
- A Benchmark Toolkit for Assembly Instructions Using the LLVM JIT☆16Updated 4 years ago
- Artifact Evaluation Reproduction for "Software Prefetching for Indirect Memory Accesses", CGO 2017, using CK.☆38Updated 3 years ago
- Cross-platform benchmarking for memory allocators, aiming to be as close to real world as it is practical☆45Updated 6 years ago
- Intel® Query Processing Library (Intel® QPL)☆103Updated 2 weeks ago
- CCProf: Lightweight Detection of Cache Conflicts☆26Updated 4 years ago
- ☆56Updated last month
- Library with JIT (Just-in-time) compilation support to optimize performance of small and medium matrix multiplication☆14Updated 3 years ago
- Montage is a system for building fast buffered persistent data structures on nonvolatile memory.☆15Updated 2 years ago
- Testing memory-level parallelism☆68Updated last year
- Haystack is an analytical cache model that given a program computes the number of cache misses.☆46Updated 5 years ago
- A benchmark for hash tables and hash functions in C++, evaluate on different data as comprehensively as possible☆19Updated last month
- Tools and Reference Code for Intel Optimizations (eg Large Pages)☆140Updated 7 months ago
- C++ interfaces for RDMA access☆72Updated last month