NMSU-PEARL / GPUs-Energy
[CF ’20] Verified Instruction-Level Energy Consumption Measurement for NVIDIA GPUs
☆15Updated 4 years ago
Alternatives and similar repositories for GPUs-Energy:
Users that are interested in GPUs-Energy are comparing it to the libraries listed below
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆25Updated 4 months ago
- Multi-GPU communication profiler and visualizer☆26Updated 8 months ago
- GPU Performance Advisor☆64Updated 2 years ago
- Performance Prediction Toolkit☆51Updated 2 months ago
- Cavs: An Efficient Runtime System for Dynamic Neural Networks☆14Updated 4 years ago
- ☆23Updated 5 years ago
- Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite☆64Updated 6 years ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Updated 5 years ago
- A GPU performance prediction toolkit for CUDA programs☆16Updated 5 years ago
- Streaming Message Interface: High-Performance Distributed Memory Programming on Reconfigurable Hardware☆16Updated 3 years ago
- Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS☆19Updated 3 weeks ago
- ☆24Updated last year
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆32Updated 3 years ago
- An Attention Superoptimizer☆21Updated last month
- ☆16Updated 2 years ago
- A repository where GPU applications are aggregated using a common build flow that supports multiple CUDA versions.☆59Updated this week
- CUDA Templates for Linear Algebra Subroutines☆14Updated this week
- CUDAAdvisor: a GPU profiling tool☆48Updated 6 years ago
- Optimize tensor program fast with Felix, a gradient descent autotuner.☆24Updated 10 months ago
- Near-storage compute aware file system and FPGA operator pipelines.☆29Updated 3 years ago
- An IR for efficiently simulating distributed ML computation.☆28Updated last year
- NUMA-aware multi-CPU multi-GPU data transfer benchmarks☆22Updated last year
- GVProf: A Value Profiler for GPU-based Clusters☆49Updated 11 months ago
- ☆30Updated 2 years ago
- 🔮 Execution time predictions for deep neural network training iterations across different GPUs.☆60Updated 2 years ago
- ☆40Updated this week
- Slides and exercises for persistent memory programming tutorial☆12Updated 2 years ago
- Code for Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture (accepted by PVLDB).The outdated wr…☆9Updated last year
- ETHZ Heterogeneous Accelerated Compute Cluster.☆31Updated this week
- ☆11Updated 9 months ago