thomwiggers / microbenchmark-aarch64Links
Microbenchmarks for Aarch64 (Cortex A53)
☆12Updated 2 years ago
Alternatives and similar repositories for microbenchmark-aarch64
Users that are interested in microbenchmark-aarch64 are comparing it to the libraries listed below
Sorting:
- ROB size testing utility☆158Updated 4 years ago
- Arm C Language Extensions (ACLE)☆117Updated last week
- Measures microarchitectural details such as ROB size. Like https://github.com/travisdowns/robsize but without runtime code generation, wh…☆132Updated 4 years ago
- Trying to figure various CPU things out☆90Updated 2 weeks ago
- Instruction latency & throughput profiler for AArch64☆40Updated 4 months ago
- Utilities to measure read access times of caches, memory, and hardware prefetches for simple and fused operations☆85Updated 2 years ago
- ☆59Updated 2 weeks ago
- Documentation of the RISC-V C API☆78Updated this week
- InstLatX64_Demo☆45Updated 2 months ago
- ☆154Updated last week
- RV: A Unified Region Vectorizer for LLVM☆112Updated 7 months ago
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆44Updated 4 years ago
- x86-64, ARM, and RVV intrinsics viewer☆76Updated last month
- A collection of RISC-V Vector (RVV) benchmarks to help developers write portably performant RVV code☆137Updated last month
- ROCm - AMDGPU Compute Application Binary Interface☆41Updated 3 years ago
- ☆17Updated 6 years ago
- Collection of synchronization micro-benchmarks and traces from infrastructure applications☆49Updated 5 months ago
- Benchmarks for auto-vectorization and revectorization, including both hand-vectorized and scalar code☆29Updated 6 years ago
- ☆65Updated last year
- Conversions to MLIR EmitC☆134Updated last year
- assembler for NVIDIA FERMI. Imported from Google Code☆76Updated 10 years ago
- Simple benchmark for memory throughput and latency☆404Updated 2 years ago
- Intel® GPU Compute Samples☆109Updated 3 months ago
- Updated C version of the Test Suite for Vectorising Compilers☆70Updated last year
- ☆68Updated 6 years ago
- ☆143Updated 2 weeks ago
- Encapsulate the frequently used AVX instructions as independent modules to reduce repeated development workload.☆129Updated last year
- Memory System Microbenchmarks☆65Updated 2 years ago
- Measure instruction latency and throughput☆28Updated 4 months ago
- ☆18Updated last year