twest820 / AVX-512
AVX-512 documentation beyond what Intel provides
☆44Updated last year
Alternatives and similar repositories for AVX-512:
Users that are interested in AVX-512 are comparing it to the libraries listed below
- InstLatX64_Demo☆41Updated this week
- ROB size testing utility☆140Updated 3 years ago
- Instruction latency & throughput profiler for AArch64☆32Updated 11 months ago
- immintrin_dbg.h is an include file, a wrapper around immintrin.h. It implements most of AVX, AVX2, AVX-512 vector intrinsics to enable so…☆57Updated 2 years ago
- RV: A Unified Region Vectorizer for LLVM☆107Updated 3 months ago
- ☆28Updated 7 months ago
- ☆56Updated 2 weeks ago
- A description of Minotaur can be found in https://arxiv.org/abs/2306.00229.☆98Updated 5 months ago
- Open Source Architecture Code Analyzer☆309Updated last week
- x86-64, ARM, and RVV intrinsics viewer☆36Updated 3 weeks ago
- A minimal (really) out-of-tree MLIR example☆36Updated 2 weeks ago
- Utilities to measure read access times of caches, memory, and hardware prefetches for simple and fused operations☆81Updated last year
- A small library and kernel module for easy access to x86 performance monitor counters under Linux.☆97Updated 8 months ago
- This is a mirror of the official libpfm4 git repository, https://sourceforge.net/p/perfmon2/libpfm4/ci/master/tree/ with some local branc…☆56Updated 2 months ago
- Performance Counter Measurements at the cycle granularity☆18Updated 3 years ago
- uops.info Code Analyzer☆250Updated last year
- Programatically obtain information about the pages backing a given memory region☆74Updated 3 years ago
- Create man pages from information used by Intel Intrinsics Guide and optionally uops.info☆45Updated last month
- ☆39Updated last month
- CPU Ultimate Latency Test.☆106Updated last year
- CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.☆109Updated 2 years ago
- Simple demonstration of using the RISC-V Vector extension☆38Updated 9 months ago
- Info on enabling AVX-512 on Alder Lake☆39Updated 2 years ago
- Very low-overhead timer/counter interfaces for C on Intel 64 processors.☆120Updated 5 years ago
- Information about AVX-512 support on recent Intel processors☆43Updated 2 years ago
- Trying to figure various CPU things out☆73Updated 11 months ago
- Record "perf" performance metrics for individual functions/regions of an ELF binary.☆74Updated last year
- Fork of LLVM for demonstrating optimization pass development☆29Updated last year
- The new home for CnC Tests and Framework Libaries☆52Updated last month
- A terminal viewer for x86 instruction/intrinsic information using Python 3 + curses☆128Updated 2 years ago