twest820 / AVX-512Links
AVX-512 documentation beyond what Intel provides
☆55Updated last year
Alternatives and similar repositories for AVX-512
Users that are interested in AVX-512 are comparing it to the libraries listed below
Sorting:
- InstLatX64_Demo☆44Updated last month
- CPU Ultimate Latency Test.☆111Updated last week
- Open Source Architecture Code Analyzer☆335Updated last week
- ROB size testing utility☆157Updated 3 years ago
- ☆58Updated this week
- immintrin_dbg.h is an include file, a wrapper around immintrin.h. It implements most of AVX, AVX2, AVX-512 vector intrinsics to enable so…☆57Updated 2 years ago
- uops.info Code Analyzer☆290Updated last year
- A description of Minotaur can be found in https://arxiv.org/abs/2306.00229.☆111Updated 2 weeks ago
- A small library and kernel module for easy access to x86 performance monitor counters under Linux.☆103Updated last year
- Instruction latency & throughput profiler for AArch64☆39Updated 3 weeks ago
- A fast implementation of log() and exp()☆52Updated 2 years ago
- Test the non-AVX, AVX2 and AVX-512 speeds across various active core counts☆225Updated 10 months ago
- ☆58Updated last week
- ☆152Updated last week
- RV: A Unified Region Vectorizer for LLVM☆111Updated 3 months ago
- Copy of instlatx64.atw.hu☆223Updated last month
- Create man pages from information used by Intel Intrinsics Guide and optionally uops.info☆45Updated 9 months ago
- The future home for CnC Tests and Framework Libaries☆58Updated last month
- AOCL-LibM☆119Updated last week
- Intel® Instrumentation and Tracing Technology (ITT) and Just-In-Time (JIT) APIs☆121Updated last month
- ☆33Updated last year
- A collection of (public) notes on assorted topics☆79Updated 3 weeks ago
- Measures microarchitectural details such as ROB size. Like https://github.com/travisdowns/robsize but without runtime code generation, wh…☆130Updated 4 years ago
- Trying to figure various CPU things out☆86Updated last year
- x86-64, ARM, and RVV intrinsics viewer☆56Updated 5 months ago
- CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.☆121Updated 2 years ago
- A tool for running small microbenchmarks on recent Intel and AMD x86 CPUs.☆481Updated 3 months ago
- A Benchmark Toolkit for Assembly Instructions Using the LLVM JIT☆16Updated 4 years ago
- Utilities to measure read access times of caches, memory, and hardware prefetches for simple and fused operations☆84Updated last year
- A terminal viewer for x86 instruction/intrinsic information using Python 3 + curses☆128Updated 2 years ago