VictorRodriguez / autofdo_tutorial
AutoFDO tutorial
☆21Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for autofdo_tutorial
- A collection of performance analysis tools, recipes, handy scripts, microbenchmarks & more☆117Updated this week
- Tools and Reference Code for Intel Optimizations (eg Large Pages)☆136Updated 2 months ago
- Collection of synchronization micro-benchmarks and traces from infrastructure applications☆38Updated 6 months ago
- This is a mirror of the official libpfm4 git repository, https://sourceforge.net/p/perfmon2/libpfm4/ci/master/tree/ with some local branc…☆55Updated 3 weeks ago
- ☆60Updated 4 months ago
- pmu event analysis package☆75Updated last year
- An unofficial mirror of the core PARSEC 3.0 benchmark suite with patches to run on x86_64 Arch Linux and generalize builds.☆99Updated 2 years ago
- ☆113Updated 3 months ago
- Repeated access to L2-containable loops to look for snoop filter conflicts on Intel Skylake Xeon processors.☆29Updated 6 years ago
- Machine-readable data describing Arm architecture and implementations. Includes JSON descriptions of implemented PMU events.☆40Updated 7 months ago
- ☆34Updated 2 years ago
- libperf is a library that wraps around the syscall perf_event_open(). This library exposes the kernel performance counters subsystem to …☆54Updated 3 years ago
- Automatically generated litmus tests for validation LISA-language Linux-kernel memory models☆21Updated last month
- Reexamining Direct Cache Access to Optimize I/O Intensive Applications for Multi-hundred-gigabit Networks☆84Updated 3 years ago
- NUMAPROF is a NUMA memory profliler based on Pintool to track your remote memory accesses.☆45Updated 4 months ago
- Enable user-mode access to ARMv7/Linux performance counters☆42Updated 8 years ago
- Memory System Microbenchmarks☆61Updated last year
- A low-overhead tool to periodically collect system-wide hardware performance counters on Intel64 systems.☆31Updated 2 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆27Updated 2 months ago
- DRAM Bank-Aware Kernel Memory Allocator☆42Updated 6 months ago
- ARMv8 performance monitor from userspace☆71Updated last year
- Pointer-chasing memory benchmark (forked from Doug Pase's code).☆58Updated 10 years ago
- User-space Page Management☆104Updated 3 months ago
- Blog and pages generated by Jekyll. Hosted on GitHub.☆54Updated this week
- Generic x86_64 PCIe latency measurement module for the Linux kernel☆56Updated 3 years ago
- ☆242Updated this week
- ROB size testing utility☆134Updated 2 years ago
- Benchmarks for auto-vectorization and revectorization, including both hand-vectorized and scalar code☆25Updated 5 years ago
- A tool for measuring the cache-coherence latencies of a processor (i.e., the latencies of loads, stores, CAS, FAI, TAS, and SWAP).☆75Updated 2 years ago
- ☆20Updated 2 years ago