travisdowns / x86-loop-testLinks
ASM methods to test small loop performance on x86
☆13Updated 6 years ago
Alternatives and similar repositories for x86-loop-test
Users that are interested in x86-loop-test are comparing it to the libraries listed below
Sorting:
- Performance Counter Measurements at the cycle granularity☆18Updated 4 years ago
- code for examining determinism of performance counters☆21Updated 4 years ago
- A small library and kernel module for easy access to x86 performance monitor counters under Linux.☆106Updated last year
- Extended Roofline Model - LLVM source tree with additional libraries for the analysis of the dynamic execution in the interpreter☆17Updated 8 years ago
- ☆40Updated 3 years ago
- GPUVerify: a Verifier for GPU Kernels☆74Updated 3 years ago
- a Pin tool for collecting microarchitecture-independent workload characteristics☆62Updated last year
- A source-to-source compiler for automatic parallelization of C programs through code annotation.☆61Updated 5 years ago
- Haystack is an analytical cache model that given a program computes the number of cache misses.☆46Updated 6 years ago
- CERE: Codelet Extractor and REplayer☆40Updated 2 years ago
- Reworking of Agner Fog's performance test programs for Linux☆116Updated 2 months ago
- ☆16Updated 6 years ago
- immintrin_dbg.h is an include file, a wrapper around immintrin.h. It implements most of AVX, AVX2, AVX-512 vector intrinsics to enable so…☆59Updated 3 years ago
- dthreads: Efficient Deterministic Multithreading☆70Updated 11 years ago
- Memory system characterization benchmarks using atomic operations☆16Updated 3 months ago
- ROB size testing utility☆158Updated 4 years ago
- A detailed michroarchitectural x86 simulator☆62Updated 8 years ago
- Decuda and cudasm, the CUDA binary utilities package. Low-level tools for NVidia G80 GPUs.☆105Updated 15 years ago
- SIMDized check which bytes are in a set☆28Updated 7 years ago
- Instruction THroughput Estimator using MAchine Learning (ITHEMAL)☆152Updated 4 years ago
- GNU Superoptimizer Version 2☆26Updated 4 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆30Updated last year
- Programatically obtain information about the pages backing a given memory region☆82Updated 4 years ago
- Files used for the evaluation of uiCA☆18Updated 3 years ago
- A GPU-based LZSS compression algorithm, highly tuned for NVIDIA GPGPUs and for streaming data, leveraging the respective strengths of CPU…☆37Updated 10 years ago
- Slice-aware Memory Management - Exploiting NUCA Characteristic of LLC in Intel Processors☆41Updated 6 years ago
- Utilities to measure read access times of caches, memory, and hardware prefetches for simple and fused operations☆85Updated 2 years ago
- RV: A Unified Region Vectorizer for LLVM☆112Updated 7 months ago
- ☆59Updated 3 weeks ago
- ☆42Updated 5 months ago