ocxtal / insn_bench_aarch64Links
Instruction latency & throughput profiler for AArch64
☆37Updated last month
Alternatives and similar repositories for insn_bench_aarch64
Users that are interested in insn_bench_aarch64 are comparing it to the libraries listed below
Sorting:
- Apple Firestorm/Icestorm CPU microarchitecture docs☆241Updated 2 years ago
- ROB size testing utility☆156Updated 3 years ago
- ☆33Updated last year
- Intel® Instrumentation and Tracing Technology (ITT) and Just-In-Time (JIT) APIs☆119Updated 2 weeks ago
- x86-64, ARM, and RVV intrinsics viewer☆55Updated 4 months ago
- Table of ARM SoC and their features☆58Updated last month
- RV: A Unified Region Vectorizer for LLVM☆111Updated 2 months ago
- Measures microarchitectural details such as ROB size. Like https://github.com/travisdowns/robsize but without runtime code generation, wh…☆129Updated 4 years ago
- A fast RISC-V emulator based on the RISC-V Sail model, and an experimental ARM one☆77Updated this week
- A minimal (really) out-of-tree MLIR example☆44Updated last week
- A description of Minotaur can be found in https://arxiv.org/abs/2306.00229.☆110Updated last year
- CPU Ultimate Latency Test.☆110Updated 2 months ago
- Embedded Universal DSL: a good DSL for us, by us☆44Updated this week
- Open Source Architecture Code Analyzer☆330Updated this week
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆40Updated 3 years ago
- ☆58Updated 3 weeks ago
- A small library and kernel module for easy access to x86 performance monitor counters under Linux.☆103Updated last year
- Example implementation of Arm's Architecture Specification Language (ASL)☆119Updated 5 years ago
- The website for freeCompilerCamp's classroom tutorials, using Github Pages.☆32Updated 3 years ago
- uops.info Code Analyzer☆286Updated last year
- Trying to figure various CPU things out☆84Updated last year
- InstLatX64_Demo☆44Updated 2 weeks ago
- ☆292Updated 7 months ago
- Exploring the scalable matrix extension of the Apple M4 processor☆194Updated 9 months ago
- Utilities to measure read access times of caches, memory, and hardware prefetches for simple and fused operations☆84Updated last year
- Benchmarks for auto-vectorization and revectorization, including both hand-vectorized and scalar code☆28Updated 6 years ago
- ☆40Updated 3 years ago
- ☆151Updated 3 weeks ago
- QEMU with support for CHERI☆59Updated last week
- Assembly super-optimization via constraint solving☆216Updated this week