berenger-eu / avx-512-sortLinks
Fast AVX512 (AVX-512) quicksort + bitonic sort.
☆28Updated 3 years ago
Alternatives and similar repositories for avx-512-sort
Users that are interested in avx-512-sort are comparing it to the libraries listed below
Sorting:
- InstLatX64_Demo☆43Updated 3 weeks ago
- ☆36Updated 11 months ago
- AVX512F and AVX2 versions of quick sort☆104Updated 7 years ago
- User-space Page Management☆107Updated 10 months ago
- Intel® Instrumentation and Tracing Technology (ITT) and Just-In-Time (JIT) APIs☆115Updated 3 weeks ago
- This is a mirror of the official libpfm4 git repository, https://sourceforge.net/p/perfmon2/libpfm4/ci/master/tree/ with some local branc…☆64Updated 7 months ago
- Programatically obtain information about the pages backing a given memory region☆77Updated 3 years ago
- Tools and Reference Code for Intel Optimizations (eg Large Pages)☆143Updated 9 months ago
- immintrin_dbg.h is an include file, a wrapper around immintrin.h. It implements most of AVX, AVX2, AVX-512 vector intrinsics to enable so…☆56Updated 2 years ago
- Testing memory-level parallelism☆68Updated last year
- Code used for generating charts and measurements of nontemporal stores☆9Updated 6 years ago
- A Scalable, Portable, and Memory-Efficient Lock-Free FIFO Queue (DISC '19)☆58Updated last year
- A small library and kernel module for easy access to x86 performance monitor counters under Linux.☆99Updated last year
- A description of Minotaur can be found in https://arxiv.org/abs/2306.00229.☆108Updated 10 months ago
- ☆58Updated 2 weeks ago
- Very low-overhead timer/counter interfaces for C on Intel 64 processors.☆134Updated 5 years ago
- Repo for OSDI 2023 paper: "Ship your Critical Section Not Your Data: Enabling Transparent Delegation with TCLocks"☆16Updated 7 months ago
- Quick sort code using AVX2 instructions☆69Updated 8 years ago
- Ocolos is the first online code layout optimization system for unmodified applications written in unmanaged languages.☆52Updated last week
- Intel® Query Processing Library (Intel® QPL)☆103Updated this week
- Collection of synchronization micro-benchmarks and traces from infrastructure applications☆44Updated last week
- ☆55Updated 6 years ago
- Source code for the FAST '23 paper “MadFS: Per-File Virtualization for Userspace Persistent Memory Filesystems”☆41Updated 2 years ago
- Generic Automatic Parallel Profiler☆35Updated 4 years ago
- ☆30Updated 3 years ago
- Sample program for article "SIMD-ized searching in unique constant dictionary" (http://0x80.pl/articles/simd-search.html)☆52Updated 8 years ago
- Intel® Data Mover Library (Intel® DML)☆95Updated 2 months ago
- Benchmark Intel TSX (Transactional Synchronization Extension) Hardware Transactional Memory on my sandbox☆24Updated 11 years ago
- ✈️ PTHash is a fast and compact minimal perfect hash function.☆229Updated 3 weeks ago
- CERE: Codelet Extractor and REplayer☆40Updated last year