Xuhpclab / DrCCTProf
DrCCTProf is a fine-grained call path profiling framework for binaries running on ARM and X86 architectures.
☆117Updated last year
Alternatives and similar repositories for DrCCTProf:
Users that are interested in DrCCTProf are comparing it to the libraries listed below
- ☆11Updated 3 years ago
- Java inefficiency detection tool based on CPU performance monitoring counters and hardware debug register. The tool detects dead writes, …☆45Updated 3 years ago
- ☆107Updated 4 years ago
- ☆97Updated 3 years ago
- ☆34Updated 2 years ago
- Official implementation of "MaxK-GNN: Extremely Fast GPU Kernel Design for Accelerating Graph Neural Networks Training"☆37Updated 10 months ago
- GVProf: A Value Profiler for GPU-based Clusters☆48Updated 10 months ago
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆22Updated 3 months ago
- Official Implementation of "Accel-GNN: High-Performance GPU Accelerator Design for Graph Neural Networks"☆49Updated last year
- Build CUDA Neural Network From Scratch☆16Updated 5 months ago
- CERE: Codelet Extractor and REplayer☆40Updated last year
- MemLiner is a remote-memory-friendly runtime system.☆30Updated 2 years ago
- A Top-Down Profiler for GPU Applications☆14Updated 11 months ago
- ☆31Updated last year
- An unofficial mirror of the core PARSEC 3.0 benchmark suite with patches to run on x86_64 Arch Linux and generalize builds.☆102Updated 2 years ago
- Automatic virtualization of (general) accelerators.☆42Updated 2 years ago
- Race detector for NVIDIA GPUs, published in SOSP 2021.☆19Updated 7 months ago
- Source code of the simulator used in the Mosaic paper from MICRO 2017: "Mosaic: A GPU Memory Manager with Application-Transparent Support…☆42Updated 6 years ago
- Tools to track memory accesses in applications and visualize the patterns to reveal opportunities for optimization.☆91Updated 9 years ago
- ☆35Updated 7 months ago
- CUDAAdvisor: a GPU profiling tool☆48Updated 6 years ago
- Artifact Evaluation Reproduction for "Software Prefetching for Indirect Memory Accesses", CGO 2017, using CK.☆38Updated 3 years ago
- Ocolos is the first online code layout optimization system for unmodified applications written in unmanaged languages.☆52Updated last year
- A CUDA compiler fuzzer☆24Updated last year
- Memory System Microbenchmarks☆62Updated last year
- DMon Prototype for OSDI 2021 Artifact Evaluation☆21Updated 3 years ago
- Source code for the FAST '23 paper “MadFS: Per-File Virtualization for Userspace Persistent Memory Filesystems”☆37Updated last year
- An LLVM pass to profile dynamic LLVM IR instructions and runtime values☆137Updated 4 years ago
- A compiler to automatically transform applications into disaggregated memory apps.☆15Updated last year
- Characterizing and Modeling Non-Volatile Memory Systems [MICRO'20, TopPicks'21]☆33Updated 3 years ago