thomwiggers / microbenchmark-aarch64Links
Microbenchmarks for Aarch64 (Cortex A53)
☆12Updated 2 years ago
Alternatives and similar repositories for microbenchmark-aarch64
Users that are interested in microbenchmark-aarch64 are comparing it to the libraries listed below
Sorting:
- ROB size testing utility☆158Updated 4 years ago
- Trying to figure various CPU things out☆90Updated this week
- Arm C Language Extensions (ACLE)☆114Updated 3 weeks ago
- Measures microarchitectural details such as ROB size. Like https://github.com/travisdowns/robsize but without runtime code generation, wh…☆131Updated 4 years ago
- ☆17Updated 6 years ago
- Collection of synchronization micro-benchmarks and traces from infrastructure applications☆49Updated 4 months ago
- Utilities to measure read access times of caches, memory, and hardware prefetches for simple and fused operations☆85Updated 2 years ago
- RV: A Unified Region Vectorizer for LLVM☆112Updated 6 months ago
- ☆154Updated last week
- Instruction latency & throughput profiler for AArch64☆40Updated 4 months ago
- ROCm - AMDGPU Compute Application Binary Interface☆41Updated 3 years ago
- A collection of RISC-V Vector (RVV) benchmarks to help developers write portably performant RVV code☆137Updated 3 weeks ago
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆44Updated 4 years ago
- ☆59Updated this week
- The CLooG Code Generator in the Polyhedral Model☆51Updated 2 years ago
- Linux Cross-Memory Attach☆97Updated last year
- Memory System Microbenchmarks☆65Updated 2 years ago
- x86-64, ARM, and RVV intrinsics viewer☆76Updated 3 weeks ago
- Documentation of the RISC-V C API☆78Updated last week
- The University of Bristol HPC Simulation Engine☆101Updated 3 months ago
- ☆63Updated last year
- Measure instruction latency and throughput☆28Updated 3 months ago
- World championship code for Graph500☆25Updated last year
- Fast AVX512 (AVX-512) quicksort + bitonic sort.☆28Updated 3 years ago
- Benchmarks for auto-vectorization and revectorization, including both hand-vectorized and scalar code☆29Updated 6 years ago
- Enabling on-the-fly manipulations with LLVM IR code of CUDA sources☆122Updated 8 months ago
- Updated C version of the Test Suite for Vectorising Compilers☆70Updated last year
- ☆68Updated 6 years ago
- Test the non-AVX, AVX2 and AVX-512 speeds across various active core counts☆229Updated last year
- SYCL Reference Manual☆28Updated last year