mlcommons / power-devLinks
Dev repo for power measurement for the MLPerf™ benchmarks
☆24Updated 4 months ago
Alternatives and similar repositories for power-dev
Users that are interested in power-dev are comparing it to the libraries listed below
Sorting:
- oneCCL Bindings for Pytorch*☆99Updated this week
- Home for OctoML PyTorch Profiler☆113Updated 2 years ago
- This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic …☆96Updated this week
- Issues related to MLPerf™ Inference policies, including rules and suggested changes☆63Updated last week
- Issues related to MLPerf™ training policies, including rules and suggested changes☆95Updated last week
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆158Updated last month
- This repository contains the results and code for the MLPerf™ Training v1.0 benchmark.☆37Updated last year
- Python bindings for NVTX☆66Updated 2 years ago
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆346Updated this week
- distributed-embeddings is a library for building large embedding based models in Tensorflow 2.☆44Updated last year
- A tensor-aware point-to-point communication primitive for machine learning☆260Updated this week
- PyTorch RFCs (experimental)☆134Updated 2 months ago
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆61Updated last month
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆149Updated this week
- Reference models for Intel(R) Gaudi(R) AI Accelerator☆167Updated 2 weeks ago
- General policies for MLPerf™ including submission rules, coding standards, etc.☆30Updated last week
- This repository contains the results and code for the MLPerf™ Training v0.7 benchmark.☆57Updated 2 years ago
- ☆144Updated 6 months ago
- MLIR-based partitioning system☆115Updated this week
- Research and development for optimizing transformers☆129Updated 4 years ago
- Samples demonstrating how to use the Compute Sanitizer Tools and Public API☆85Updated last year
- NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the …☆196Updated last week
- MLCube® is a project that reduces friction for machine learning by ensuring that models are easily portable and reproducible.☆157Updated 10 months ago
- WholeGraph - large scale Graph Neural Networks☆104Updated 8 months ago
- A schedule language for large model training☆149Updated last year
- ☆251Updated last year
- Benchmarks to capture important workloads.☆31Updated 6 months ago
- Stores documents and resources used by the OpenXLA developer community☆126Updated last year
- A library to analyze PyTorch traces.☆402Updated this week
- Training neural networks in TensorFlow 2.0 with 5x less memory☆132Updated 3 years ago