twest820 / AVX-512Links
AVX-512 documentation beyond what Intel provides
☆59Updated 2 years ago
Alternatives and similar repositories for AVX-512
Users that are interested in AVX-512 are comparing it to the libraries listed below
Sorting:
- InstLatX64_Demo☆45Updated last month
- CPU Ultimate Latency Test.☆116Updated 3 months ago
- uops.info Code Analyzer☆306Updated last year
- x86-64, ARM, and RVV intrinsics viewer☆76Updated 3 weeks ago
- ROB size testing utility☆159Updated 3 years ago
- immintrin_dbg.h is an include file, a wrapper around immintrin.h. It implements most of AVX, AVX2, AVX-512 vector intrinsics to enable so…☆59Updated 2 years ago
- Open Source Architecture Code Analyzer☆342Updated last week
- A fast implementation of log() and exp()☆53Updated 3 years ago
- A description of Minotaur can be found in https://arxiv.org/abs/2306.00229.☆119Updated 3 months ago
- Instruction latency & throughput profiler for AArch64☆40Updated 3 months ago
- ☆57Updated 3 months ago
- RV: A Unified Region Vectorizer for LLVM☆112Updated 6 months ago
- ☆59Updated last week
- Support for ternary logic in SSE, XOP, AVX2 and x86 programs☆31Updated 11 months ago
- A minimal (really) out-of-tree MLIR example☆46Updated 4 months ago
- CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.☆126Updated 2 years ago
- Test the non-AVX, AVX2 and AVX-512 speeds across various active core counts☆229Updated last year
- A terminal viewer for x86 instruction/intrinsic information using Python 3 + curses☆128Updated 3 years ago
- C++ template library for floating point operations☆36Updated last week
- Record "perf" performance metrics for individual functions/regions of an ELF binary.☆81Updated last year
- AOCL-LibM☆123Updated last month
- ☆50Updated this week
- Trying to figure various CPU things out☆90Updated last year
- A small library and kernel module for easy access to x86 performance monitor counters under Linux.☆106Updated last year
- Create man pages from information used by Intel Intrinsics Guide and optionally uops.info☆45Updated last year
- Very low-overhead timer/counter interfaces for C on Intel 64 processors.☆138Updated last month
- A tool for running small microbenchmarks on recent Intel and AMD x86 CPUs.☆498Updated 3 weeks ago
- A performant, parallel, probabilistic, random acyclic-graph, low-latency, perfect hash generation library.☆86Updated 6 months ago
- Reworking of Agner Fog's performance test programs for Linux☆115Updated last month
- Copy of instlatx64.atw.hu☆229Updated 2 weeks ago