twest820 / AVX-512
AVX-512 documentation beyond what Intel provides
☆46Updated last year
Alternatives and similar repositories for AVX-512:
Users that are interested in AVX-512 are comparing it to the libraries listed below
- InstLatX64_Demo☆41Updated last month
- ROB size testing utility☆142Updated 3 years ago
- The new home for CnC Tests and Framework Libaries☆54Updated 2 months ago
- Open Source Architecture Code Analyzer☆311Updated 3 weeks ago
- A small library and kernel module for easy access to x86 performance monitor counters under Linux.☆98Updated 9 months ago
- uops.info Code Analyzer☆256Updated last year
- Trying to figure various CPU things out☆74Updated last year
- RV: A Unified Region Vectorizer for LLVM☆107Updated 3 weeks ago
- Instruction latency & throughput profiler for AArch64☆32Updated last year
- A description of Minotaur can be found in https://arxiv.org/abs/2306.00229.☆100Updated 6 months ago
- A minimal (really) out-of-tree MLIR example☆37Updated last month
- immintrin_dbg.h is an include file, a wrapper around immintrin.h. It implements most of AVX, AVX2, AVX-512 vector intrinsics to enable so…☆57Updated 2 years ago
- ☆56Updated this week
- This is a mirror of the official libpfm4 git repository, https://sourceforge.net/p/perfmon2/libpfm4/ci/master/tree/ with some local branc…☆56Updated 3 months ago
- x86-64, ARM, and RVV intrinsics viewer☆42Updated this week
- ☆28Updated 8 months ago
- CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.☆113Updated 2 years ago
- CPU Ultimate Latency Test.☆107Updated last year
- A terminal viewer for x86 instruction/intrinsic information using Python 3 + curses☆128Updated 2 years ago
- Utilities to measure read access times of caches, memory, and hardware prefetches for simple and fused operations☆81Updated last year
- Little OpenMP Library☆157Updated 2 years ago
- A collection of performance analysis tools, recipes, handy scripts, microbenchmarks & more☆129Updated this week
- Programatically obtain information about the pages backing a given memory region☆74Updated 3 years ago
- Companion Repository for the Lecture Slides for the Clang Libraries☆87Updated 11 months ago
- ☆55Updated 5 months ago
- A Benchmark Toolkit for Assembly Instructions Using the LLVM JIT☆16Updated 4 years ago
- Generates CIL MLIR dialect from C/C++ source.☆32Updated 4 years ago
- Trying to figure various CPU things out☆113Updated last week
- Intel® Instrumentation and Tracing Technology (ITT) and Just-In-Time (JIT) API☆96Updated last week
- Record "perf" performance metrics for individual functions/regions of an ELF binary.☆77Updated last year