travisdowns / x86-loop-test
ASM methods to test small loop performance on x86
☆13Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for x86-loop-test
- Performance Counter Measurements at the cycle granularity☆18Updated 3 years ago
- code for examining determinism of performance counters☆21Updated 3 years ago
- A small library and kernel module for easy access to x86 performance monitor counters under Linux.☆94Updated 6 months ago
- Extended Roofline Model - LLVM source tree with additional libraries for the analysis of the dynamic execution in the interpreter☆17Updated 7 years ago
- Haystack is an analytical cache model that given a program computes the number of cache misses.☆42Updated 5 years ago
- immintrin_dbg.h is an include file, a wrapper around immintrin.h. It implements most of AVX, AVX2, AVX-512 vector intrinsics to enable so…☆57Updated last year
- Intel Heterogeneous Research Compiler (iHRC)☆25Updated last year
- Benchmark for memory store throughput☆23Updated 3 years ago
- CERE: Codelet Extractor and REplayer☆41Updated last year
- ☆35Updated 2 years ago
- CCProf: Lightweight Detection of Cache Conflicts☆25Updated 3 years ago
- memTrace, a framework for lightweight memory tracing☆55Updated 4 years ago
- ☆35Updated this week
- ☆15Updated last year
- Multiplication using AVX512 and AVX512IFMA instructions☆23Updated 9 years ago
- A Benchmark Toolkit for Assembly Instructions Using the LLVM JIT☆16Updated 4 years ago
- Collection of Agner Fog Software☆36Updated 6 years ago
- benchmarking positional population count☆12Updated 8 months ago
- CUDAAdvisor: a GPU profiling tool☆48Updated 6 years ago
- Code for experiments referenced in the Usenix Security 2017 paper "Strong and Efficient Cache Side-Channel Protection using Hardware Tran…☆13Updated 2 years ago
- Repeated access to L2-containable loops to look for snoop filter conflicts on Intel Skylake Xeon processors.☆29Updated 6 years ago
- GNU Superoptimizer Version 2☆25Updated 3 years ago
- Fine-grained frequency and voltage transition tests☆19Updated last year
- Mallacc: Accelerating Memory Allocation☆13Updated 6 years ago
- a Pin tool for collecting microarchitecture-independent workload characteristics☆59Updated 9 months ago
- library which simplifies host-GPU data transfer using userspace pagefault handling☆15Updated 12 years ago
- A low-overhead tool to periodically collect system-wide hardware performance counters on Intel64 systems.☆31Updated 2 years ago
- A detailed michroarchitectural x86 simulator☆61Updated 7 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆27Updated 2 months ago