travisdowns / x86-loop-testLinks
ASM methods to test small loop performance on x86
☆13Updated 6 years ago
Alternatives and similar repositories for x86-loop-test
Users that are interested in x86-loop-test are comparing it to the libraries listed below
Sorting:
- Performance Counter Measurements at the cycle granularity☆18Updated 4 years ago
- Extended Roofline Model - LLVM source tree with additional libraries for the analysis of the dynamic execution in the interpreter☆17Updated 8 years ago
- A small library and kernel module for easy access to x86 performance monitor counters under Linux.☆103Updated last year
- code for examining determinism of performance counters☆21Updated 4 years ago
- ☆40Updated 3 years ago
- immintrin_dbg.h is an include file, a wrapper around immintrin.h. It implements most of AVX, AVX2, AVX-512 vector intrinsics to enable so…☆58Updated 2 years ago
- Reworking of Agner Fog's performance test programs for Linux☆114Updated 6 years ago
- Memory system characterization benchmarks using atomic operations☆15Updated last year
- CERE: Codelet Extractor and REplayer☆40Updated 2 years ago
- A source-to-source compiler for automatic parallelization of C programs through code annotation.☆62Updated 5 years ago
- a Pin tool for collecting microarchitecture-independent workload characteristics☆60Updated last year
- GPUVerify: a Verifier for GPU Kernels☆67Updated 3 years ago
- Haystack is an analytical cache model that given a program computes the number of cache misses.☆46Updated 6 years ago
- Programatically obtain information about the pages backing a given memory region☆79Updated 3 years ago
- Utilities to measure read access times of caches, memory, and hardware prefetches for simple and fused operations☆84Updated last year
- Intel Heterogeneous Research Compiler (iHRC)☆25Updated 2 years ago
- CUDAAdvisor: a GPU profiling tool☆50Updated 7 years ago
- A detailed michroarchitectural x86 simulator☆62Updated 8 years ago
- Slice-aware Memory Management - Exploiting NUCA Characteristic of LLC in Intel Processors☆41Updated 6 years ago
- ☆16Updated 6 years ago
- A GPU-based LZSS compression algorithm, highly tuned for NVIDIA GPGPUs and for streaming data, leveraging the respective strengths of CPU…☆35Updated 9 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆30Updated last year
- ☆74Updated 2 years ago
- Stable, non-KVM version of PTLsim.☆29Updated 9 years ago
- ☆58Updated 2 weeks ago
- ROB size testing utility☆157Updated 3 years ago
- Tapir extension to LLVM for optimizing Parallel Programs☆135Updated 5 years ago
- Repeated access to L2-containable loops to look for snoop filter conflicts on Intel Skylake Xeon processors.☆29Updated 7 years ago
- A survey on architectural simulators focused on CPU caches.☆16Updated 5 years ago
- Evaluating different memory managers for dynamic GPU memory☆26Updated 4 years ago