rdolbeau / enable_arm_pmuLinks
Forked from https://github.com/thoughtpolice/enable_arm_pmu to enable user-mode access to ARMv8/Linux performance counters
☆25Updated last month
Alternatives and similar repositories for enable_arm_pmu
Users that are interested in enable_arm_pmu are comparing it to the libraries listed below
Sorting:
- Enable user-mode access to ARMv7/Linux performance counters☆80Updated 4 years ago
- Arm C Language Extensions (ACLE)☆110Updated last month
- Decuda and cudasm, the CUDA binary utilities package. Low-level tools for NVidia G80 GPUs.☆102Updated 14 years ago
- ☆58Updated last month
- ROCm - AMDGPU Compute Application Binary Interface☆41Updated 3 years ago
- Collection of synchronization micro-benchmarks and traces from infrastructure applications☆45Updated last month
- PROGRESS64 is a C library of scalable functions for concurrent programs, primarily focused on networking applications.☆92Updated 3 months ago
- Microbenchmarks for Aarch64 (Cortex A53)☆12Updated 2 years ago
- ☆151Updated 3 weeks ago
- ☆16Updated 5 years ago
- Encapsulate the frequently used AVX instructions as independent modules to reduce repeated development workload.☆123Updated last year
- Utilities to measure read access times of caches, memory, and hardware prefetches for simple and fused operations☆83Updated last year
- Test the non-AVX, AVX2 and AVX-512 speeds across various active core counts☆216Updated 8 months ago
- Measures microarchitectural details such as ROB size. Like https://github.com/travisdowns/robsize but without runtime code generation, wh…☆129Updated 4 years ago
- Asynchronous Task and Memory Interface, or ATMI, is a runtime framework and programming model for heterogeneous CPU-GPU systems. It provi…☆68Updated last year
- Intel® GPU Compute Samples☆108Updated last month
- A small library and kernel module for easy access to x86 performance monitor counters under Linux.☆100Updated last year
- ROB size testing utility☆155Updated 3 years ago
- Trying to figure various CPU things out☆78Updated last year
- LLVM AMDGPU Assembler Helper Tools☆113Updated 8 years ago
- C for Media Runtime☆24Updated 2 years ago
- Information about AVX-512 support on recent Intel processors☆45Updated 3 years ago
- Instruction latency & throughput profiler for AArch64☆34Updated last week
- ☆133Updated last month
- Flexible GPGPU instrumentation☆88Updated 5 years ago
- Simple benchmark for memory throughput and latency☆384Updated 2 years ago
- Accelerated CRC32 for POWER8 using vpmsum instructions☆33Updated 5 years ago
- AArch64cryptolib is a from scratch implementation of cryptographic primitives aiming for optimal performance on Arm A-class cores☆39Updated last month
- CacheDirector - Sending Packets to the Right Slice by Exploiting Intel Last-Level Cache Addressing☆12Updated 6 years ago
- oneAPI Specification source files☆204Updated 3 weeks ago