intel / Immintrin-debug
immintrin_dbg.h is an include file, a wrapper around immintrin.h. It implements most of AVX, AVX2, AVX-512 vector intrinsics to enable source level debug of vector code.
☆57Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Immintrin-debug
- InstLatX64_Demo☆41Updated 3 months ago
- Fast C header-only library for popcnt, pospopcnt, and set algebraic operations☆45Updated 4 years ago
- CPU Ultimate Latency Test.☆105Updated last year
- Very low-overhead timer/counter interfaces for C on Intel 64 processors.☆116Updated 5 years ago
- benchmarking positional population count☆11Updated 7 months ago
- A terminal viewer for x86 instruction/intrinsic information using Python 3 + curses☆128Updated last year
- A small library and kernel module for easy access to x86 performance monitor counters under Linux.☆94Updated 6 months ago
- Reworking of Agner Fog's performance test programs for Linux☆110Updated 5 years ago
- AVX512 population count routines☆22Updated 5 years ago
- Programatically obtain information about the pages backing a given memory region☆73Updated 3 years ago
- Create man pages from information used by Intel Intrinsics Guide and optionally uops.info☆42Updated 3 years ago
- Information about AVX-512 support on recent Intel processors☆43Updated 2 years ago
- Support for ternary logic in SSE, XOP, AVX2 and x86 programs☆31Updated 3 years ago
- ☆53Updated last week
- C library to compute the Hamming weight of arrays☆44Updated 5 years ago
- Intriman is a documentation generator that retargets the Intel Intrinsics Guide to other documentation formats☆28Updated 2 years ago
- Mirror of Agner Fog's C++ vector class library☆30Updated 5 years ago
- AVX512F and AVX2 versions of quick sort☆105Updated 6 years ago
- Portable 128-bit SIMD intrinsics☆55Updated last year
- ☆15Updated 10 years ago
- SIMD recipes, for various platforms (collection of code snippets)☆48Updated 3 years ago
- Quick sort code using AVX2 instructions☆68Updated 7 years ago
- A binary optimization/specialization library supporting multiple rewriters☆112Updated 3 years ago
- Fastest CPU SIMD (SSE4) sorting networks for small integer arrays (2-6 elements), also optimal amd64 assembly and notes on getting compil…☆44Updated 3 years ago
- CERE: Codelet Extractor and REplayer☆41Updated last year
- Collection of Agner Fog Software☆36Updated 6 years ago
- Pruning elements in SIMD vectors (i.e., packing left elements)☆60Updated 9 months ago
- ☆68Updated 4 years ago
- Base64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)☆157Updated 7 months ago
- A fast implementation of log() and exp()☆49Updated last year