Xuhpclab / DrCCTProf
DrCCTProf is a fine-grained call path profiling framework for binaries running on ARM and X86 architectures.
☆118Updated last year
Alternatives and similar repositories for DrCCTProf:
Users that are interested in DrCCTProf are comparing it to the libraries listed below
- ☆11Updated 3 years ago
- ☆34Updated 3 years ago
- Java inefficiency detection tool based on CPU performance monitoring counters and hardware debug register. The tool detects dead writes, …☆45Updated 3 years ago
- ☆97Updated 3 years ago
- ☆107Updated 4 years ago
- GVProf: A Value Profiler for GPU-based Clusters☆49Updated 11 months ago
- Official implementation of "MaxK-GNN: Extremely Fast GPU Kernel Design for Accelerating Graph Neural Networks Training"☆37Updated last year
- ☆31Updated last year
- Artifact Evaluation Reproduction for "Software Prefetching for Indirect Memory Accesses", CGO 2017, using CK.☆38Updated 3 years ago
- DMon Prototype for OSDI 2021 Artifact Evaluation☆22Updated 3 years ago
- MemLiner is a remote-memory-friendly runtime system.☆30Updated 2 years ago
- CUDAAdvisor: a GPU profiling tool☆48Updated 6 years ago
- A low-overhead tool to periodically collect system-wide hardware performance counters on Intel64 systems.☆31Updated 2 years ago
- Official Implementation of "Accel-GNN: High-Performance GPU Accelerator Design for Graph Neural Networks"☆49Updated last year
- Light-weight Performance Variance Detection for Production-run Parallel Applications☆12Updated last year
- ☆15Updated 7 months ago
- CERE: Codelet Extractor and REplayer☆40Updated last year
- AST interpreter with clang 5.0.0 and llvm 5.0.0☆14Updated 5 years ago
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆25Updated 4 months ago
- Graspan-G is a GPU-based version of Graspan.☆9Updated 3 years ago
- Intel® Data Mover Library (Intel® DML)☆92Updated 5 months ago
- Race detector for NVIDIA GPUs, published in SOSP 2021.☆19Updated last week
- HQEMU v2.5.1 is a retargetable and multi-threaded dynamic binary translator on multicores☆21Updated 6 years ago
- A collection of code based on LLVM/Clang compilation libraries and tools☆39Updated 5 years ago
- Tools to track memory accesses in applications and visualize the patterns to reveal opportunities for optimization.☆92Updated 9 years ago
- The ultimate memory bandwidth benchmark☆47Updated 3 weeks ago
- SpV8 is a SpMV kernel written in AVX-512. Artifact for our SpV8 paper @ DAC '21.☆27Updated 3 years ago
- REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…☆91Updated 2 years ago
- Automatic virtualization of (general) accelerators.☆42Updated 2 years ago
- EDC20: Code repository for the auto_navigation_car based on stm32. Contributed by the team A_star(champion team of the 20th Tsinghua Univ…☆21Updated 5 years ago