mlcommons / power-dev
Dev repo for power measurement for the MLPerf™ benchmarks
☆20Updated last week
Alternatives and similar repositories for power-dev:
Users that are interested in power-dev are comparing it to the libraries listed below
- Home for OctoML PyTorch Profiler☆112Updated last year
- Issues related to MLPerf™ Inference policies, including rules and suggested changes☆62Updated last month
- Python bindings for NVTX☆66Updated last year
- distributed-embeddings is a library for building large embedding based models in Tensorflow 2.☆44Updated last year
- oneCCL Bindings for Pytorch*☆94Updated last week
- This repository contains the results and code for the MLPerf™ Training v0.7 benchmark.☆56Updated last year
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆317Updated this week
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆136Updated this week
- MLIR-based partitioning system☆80Updated this week
- This repository contains the results and code for the MLPerf™ Training v1.0 benchmark.☆38Updated last year
- ☆39Updated 4 months ago
- A schedule language for large model training☆146Updated 10 months ago
- Issues related to MLPerf™ training policies, including rules and suggested changes☆94Updated 2 weeks ago
- Benchmarks to capture important workloads.☆31Updated 2 months ago
- NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.☆116Updated last year
- Research and development for optimizing transformers☆125Updated 4 years ago
- Training neural networks in TensorFlow 2.0 with 5x less memory☆130Updated 3 years ago
- ☆163Updated 10 months ago
- PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections☆119Updated 2 years ago
- NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the …☆146Updated this week
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆156Updated 4 months ago
- Shared Middle-Layer for Triton Compilation☆245Updated this week
- PyTorch emulation library for Microscaling (MX)-compatible data formats☆216Updated 6 months ago
- ☆141Updated 2 months ago
- ☆16Updated this week
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆62Updated last month
- A Deep Learning Meta-Framework and HPC Benchmarking Library☆81Updated 2 years ago
- A tensor-aware point-to-point communication primitive for machine learning☆256Updated 2 years ago
- MLPerf™ logging library☆34Updated last week
- A library to analyze PyTorch traces.☆366Updated last week