Power measurement for CUDA programs by polling using NVIDIA Management Library (nvml) APIs.
☆26Jun 24, 2017Updated 8 years ago
Alternatives and similar repositories for nvml-power
Users that are interested in nvml-power are comparing it to the libraries listed below
Sorting:
- An llvm pass for counting global uncoalesced acceses for cuda code via dynamic analysis.☆14Nov 17, 2018Updated 7 years ago
- Fast binary matrix product on CPU☆10Feb 11, 2016Updated 10 years ago
- PSTensor provides a way to hack the memory management of tensors in TensorFlow and PyTorch by defining your own C++ Tensor Class.☆10Feb 10, 2022Updated 4 years ago
- [CF ’20] Verified Instruction-Level Energy Consumption Measurement for NVIDIA GPUs☆15Dec 11, 2020Updated 5 years ago
- A demo project demonstrating the performance improvement by cpp extension, which wrapped with pybind11.☆10Nov 16, 2021Updated 4 years ago
- 「賞金で二郎一生分食べたい!」チームのレポジトリです.☆11Dec 9, 2021Updated 4 years ago
- ACT An Architectural Carbon Modeling Tool for Designing Sustainable Computer Systems☆45Jul 21, 2025Updated 7 months ago
- DXライブラリをRustで使えるようにする☆10Mar 7, 2021Updated 5 years ago
- A parser for PTX 6.5☆13Jun 19, 2023Updated 2 years ago
- CPU and GPU tutorial examples☆13Apr 4, 2025Updated 11 months ago
- CK workflow, portable packages and other artifacts for the ReQuEST-ASPLOS'18 submission:☆11Jan 16, 2019Updated 7 years ago
- ☆11Jun 9, 2023Updated 2 years ago
- ReactiveX for Objective-C☆14Dec 30, 2020Updated 5 years ago
- ☆12Aug 15, 2023Updated 2 years ago
- rice: An ANSI C implementation of Rice coding (Golomb-Rice coding)☆15Oct 23, 2019Updated 6 years ago
- Logger for MPI communication☆27Jul 12, 2023Updated 2 years ago
- A tool for checking tool output inspired by LLVM's FileCheck☆13Aug 29, 2025Updated 6 months ago
- DATS JSON schemas☆13Dec 21, 2022Updated 3 years ago
- Fast GPU error-bounded lossy compressor for floating-point data.☆56Mar 10, 2026Updated last week
- Fast Synchronization-Free Algorithms for Parallel Sparse Triangular Solves with Multiple Right-Hand Sides (SpTRSM)☆14Feb 14, 2020Updated 6 years ago
- Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Mult…☆41Mar 17, 2024Updated 2 years ago
- SParse AcceleRation on Tensor Architecture☆18Apr 7, 2025Updated 11 months ago
- Validated Collective Knowledge workflows and results from the 1st ACM ReQuEST tournament on co-design of Pareto-efficient SW/HW stack for…☆13Oct 16, 2018Updated 7 years ago
- This package provides a ROS2 driver node for HOKUYO 3D LiDAR(SOKUIKI Sensor).☆13Dec 11, 2025Updated 3 months ago
- A set of cog recipes for C++ reflection☆15Aug 21, 2011Updated 14 years ago
- ☆25Nov 20, 2025Updated 4 months ago
- Unstructured computations on emerging architectures.☆14Jun 1, 2022Updated 3 years ago
- Crellvm: Verified Credible Compilation for LLVM☆18Jun 26, 2018Updated 7 years ago
- UPP is a minimalist and generic text preprocessor using Lua macros.☆13Oct 13, 2024Updated last year
- inference on tvm runtime using c++ with gpu enabled☆10Apr 25, 2018Updated 7 years ago
- Artifact for 'Register Optimizations for Stencils on GPUs'☆10Sep 18, 2018Updated 7 years ago
- JUPITER Benchmark Suite☆23Jul 18, 2025Updated 8 months ago
- ☆10Jun 18, 2024Updated last year
- Cayley Dickson algebra implementation in python☆12Jan 3, 2019Updated 7 years ago
- Canopy is a machine learning learning compiler stack with the capability of adopting high-end FPGAs. As a part of OpenAIOS project, Canop…☆12May 7, 2021Updated 4 years ago
- Python GUI for differential forms☆13Oct 14, 2023Updated 2 years ago
- ☆17Nov 13, 2019Updated 6 years ago
- ☆10Dec 8, 2021Updated 4 years ago
- A compact hash algorithm for CPUs and GPUs using OpenCL☆15Sep 26, 2020Updated 5 years ago