zingaburga / alderlake_avx512
Info on enabling AVX-512 on Alder Lake
☆39Updated 2 years ago
Related projects: ⓘ
- InstLatX64_Demo☆41Updated last month
- AVX-512 documentation beyond what Intel provides☆38Updated 9 months ago
- ☆50Updated last week
- Microbenchmarking experiments on Zen 2 machines☆15Updated 2 years ago
- A software library of lossless data compression methods tuned and optimized for AMD “Zen”-based CPUs☆22Updated 4 months ago
- Trying to figure various CPU things out☆62Updated last week
- ☆23Updated 3 months ago
- A Metal implementation similar to the official Metal C++ API☆39Updated last year
- Intel® Instrumentation and Tracing Technology (ITT) and Just-In-Time (JIT) API☆85Updated last week
- Instruction latency & throughput profiler for AArch64☆31Updated 7 months ago
- Micro benchmarks CPU/GPU☆47Updated 2 years ago
- Utilities for accessing AMD's Machine-Readable GPU ISA Specifications.☆15Updated 3 weeks ago
- ☆28Updated 4 months ago
- A simple benchmark which measures latency between CPU cores.☆39Updated 7 years ago
- GPGMM, a General-Purpose GPU Memory Management Library.☆32Updated 7 months ago
- Derived from Nemes' gpuperftests☆24Updated 2 months ago
- Performance Counter Measurements at the cycle granularity☆17Updated 3 years ago
- A description of Minotaur can be found in https://arxiv.org/abs/2306.00229.☆86Updated last month
- immintrin_dbg.h is an include file, a wrapper around immintrin.h. It implements most of AVX, AVX2, AVX-512 vector intrinsics to enable so…☆57Updated last year
- OpenCL/SPIR-V implementation of HIP☆104Updated last year
- A runtime SPIR-V assembler☆40Updated last year
- A small library and kernel module for easy access to x86 performance monitor counters under Linux.☆91Updated 4 months ago
- A Benchmark Toolkit for Assembly Instructions Using the LLVM JIT☆16Updated 3 years ago
- chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.☆185Updated this week
- Test if AVX vector loads and stores are atomic☆22Updated 4 years ago
- An attempt to make a more accessible microbenchmark☆118Updated 3 months ago
- Fine-grained frequency and voltage transition tests☆19Updated last year
- Marek's approach to building AMD GPU drivers for driver development☆18Updated this week
- ZP7: Zach's Peppy Parallel-Prefix-Popcountin' PEXT/PDEP Polyfill☆43Updated last month
- A cross-platform (Windows and Linux) CPU memory latency benchmark.☆45Updated 2 years ago