NMSU-PEARL / GPUs-EnergyLinks
[CF ’20] Verified Instruction-Level Energy Consumption Measurement for NVIDIA GPUs
☆15Updated 5 years ago
Alternatives and similar repositories for GPUs-Energy
Users that are interested in GPUs-Energy are comparing it to the libraries listed below
Sorting:
- A Data-Centric Compiler for Machine Learning☆85Updated last year
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆27Updated last year
- GPU Performance Advisor☆65Updated 3 years ago
- Performance Prediction Toolkit☆54Updated 3 months ago
- An IR for efficiently simulating distributed ML computation.☆31Updated last year
- ☆20Updated 6 years ago
- A GPU performance prediction toolkit for CUDA programs☆18Updated 6 years ago
- GEMM and Winograd based convolutions using CUTLASS☆28Updated 5 years ago
- COCCL: Compression and precision co-aware collective communication library☆29Updated 8 months ago
- Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite☆67Updated 7 years ago
- ParaDnn: A systematic performance analysis methodology for deep learning.☆40Updated 5 years ago
- A lightweight, Pythonic, frontend for MLIR☆80Updated 2 years ago
- Training neural networks in TensorFlow 2.0 with 5x less memory☆137Updated 3 years ago
- An HPL-AI implementation for Fugaku☆22Updated 4 years ago
- ☆48Updated 5 years ago
- ☆11Updated 4 years ago
- SparseTIR: Sparse Tensor Compiler for Deep Learning☆140Updated 2 years ago
- MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.com☆38Updated 2 years ago
- ☆41Updated 2 months ago
- Tutorials for NVIDIA CUPTI samples☆42Updated last month
- GVProf: A Value Profiler for GPU-based Clusters☆52Updated last year
- 🎃 GPU load-balancing library for regular and irregular computations.☆63Updated 3 months ago
- 🔮 Execution time predictions for deep neural network training iterations across different GPUs.☆63Updated 3 years ago
- ☆17Updated 4 years ago
- A novel spatial accelerator for horizontal diffusion weather stencil computation, as described in ICS 2023 paper by Singh et al. (https:/…☆22Updated 2 years ago
- Slides and exercises for persistent memory programming tutorial☆14Updated 3 years ago
- ☆28Updated 2 weeks ago
- ☆288Updated 2 months ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆47Updated 3 months ago
- NUMA-aware multi-CPU multi-GPU data transfer benchmarks☆26Updated 2 years ago