thomwiggers / microbenchmark-aarch64Links
Microbenchmarks for Aarch64 (Cortex A53)
☆12Updated 2 years ago
Alternatives and similar repositories for microbenchmark-aarch64
Users that are interested in microbenchmark-aarch64 are comparing it to the libraries listed below
Sorting:
- Arm C Language Extensions (ACLE)☆119Updated 2 weeks ago
- Trying to figure various CPU things out☆92Updated last week
- ROB size testing utility☆159Updated 4 years ago
- Measures microarchitectural details such as ROB size. Like https://github.com/travisdowns/robsize but without runtime code generation, wh…☆132Updated 4 years ago
- ROCm - AMDGPU Compute Application Binary Interface☆40Updated 3 years ago
- Instruction latency & throughput profiler for AArch64☆42Updated 5 months ago
- Collection of synchronization micro-benchmarks and traces from infrastructure applications☆50Updated 6 months ago
- Forked from https://github.com/thoughtpolice/enable_arm_pmu to enable user-mode access to ARMv8/Linux performance counters☆25Updated 8 months ago
- ☆154Updated 2 weeks ago
- Utilities to measure read access times of caches, memory, and hardware prefetches for simple and fused operations☆85Updated 2 years ago
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆44Updated 4 years ago
- Machine-readable data describing Arm architecture and implementations. Includes JSON descriptions of implemented PMU events.☆59Updated last year
- x86-64, ARM, and RVV intrinsics viewer☆76Updated 2 months ago
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆148Updated 2 weeks ago
- The SHOC Benchmark Suite☆260Updated 3 months ago
- Simple benchmark for memory throughput and latency☆408Updated 2 years ago
- Memory System Microbenchmarks☆65Updated 2 years ago
- CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.☆127Updated 3 years ago
- Intel® GPU Compute Samples☆109Updated 4 months ago
- RV: A Unified Region Vectorizer for LLVM☆113Updated 8 months ago
- ☆59Updated last month
- ☆17Updated 6 years ago
- C for Media Runtime☆24Updated 3 years ago
- A collection of RISC-V Vector (RVV) benchmarks to help developers write portably performant RVV code☆141Updated this week
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆138Updated 2 years ago
- ☆68Updated 6 years ago
- Measure instruction latency and throughput☆29Updated 5 months ago
- Enabling on-the-fly manipulations with LLVM IR code of CUDA sources☆125Updated 9 months ago
- InstLatX64_Demo☆45Updated 3 months ago
- a clone of POCL that includes RISC-V newlib devices support and Vortex☆49Updated 2 weeks ago