thomwiggers / microbenchmark-aarch64Links
Microbenchmarks for Aarch64 (Cortex A53)
☆12Updated 2 years ago
Alternatives and similar repositories for microbenchmark-aarch64
Users that are interested in microbenchmark-aarch64 are comparing it to the libraries listed below
Sorting:
- ROB size testing utility☆158Updated 3 years ago
- Trying to figure various CPU things out☆87Updated last year
- ☆154Updated this week
- Measures microarchitectural details such as ROB size. Like https://github.com/travisdowns/robsize but without runtime code generation, wh…☆132Updated 4 years ago
- Utilities to measure read access times of caches, memory, and hardware prefetches for simple and fused operations☆85Updated 2 years ago
- Instruction latency & throughput profiler for AArch64☆39Updated 2 months ago
- Arm C Language Extensions (ACLE)☆115Updated last week
- ROCm - AMDGPU Compute Application Binary Interface☆41Updated 3 years ago
- InstLatX64_Demo☆44Updated 2 weeks ago
- Collection of synchronization micro-benchmarks and traces from infrastructure applications☆48Updated 3 months ago
- ☆59Updated this week
- x86-64, ARM, and RVV intrinsics viewer☆66Updated last month
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆43Updated 4 years ago
- ☆17Updated 5 years ago
- Documentation of the RISC-V C API☆77Updated this week
- Memory System Microbenchmarks☆64Updated 2 years ago
- A collection of RISC-V Vector (RVV) benchmarks to help developers write portably performant RVV code☆133Updated last month
- C for Media Runtime☆24Updated 3 years ago
- RV: A Unified Region Vectorizer for LLVM☆112Updated 5 months ago
- Stable, non-KVM version of PTLsim.☆29Updated 9 years ago
- Measure instruction latency and throughput☆25Updated 2 months ago
- Encapsulate the frequently used AVX instructions as independent modules to reduce repeated development workload.☆126Updated last year
- Benchmarks for auto-vectorization and revectorization, including both hand-vectorized and scalar code☆30Updated 6 years ago
- Fast AVX512 (AVX-512) quicksort + bitonic sort.☆28Updated 3 years ago
- Open Source Architecture Code Analyzer☆335Updated last month
- Intel® GPU Compute Samples☆109Updated last month
- Develop toolchain based on llvm to for Cpu0 processor☆49Updated 3 weeks ago
- assembler for NVIDIA FERMI. Imported from Google Code☆73Updated 10 years ago
- ☆54Updated 5 years ago
- Simple benchmark for memory throughput and latency☆398Updated 2 years ago