mlcommons / power-devLinks
Dev repo for power measurement for the MLPerf™ benchmarks
☆26Updated 4 months ago
Alternatives and similar repositories for power-dev
Users that are interested in power-dev are comparing it to the libraries listed below
Sorting:
- Home for OctoML PyTorch Profiler☆113Updated 2 years ago
- A tensor-aware point-to-point communication primitive for machine learning☆283Updated last month
- oneCCL Bindings for Pytorch* (deprecated)☆104Updated last month
- Issues related to MLPerf® Inference policies, including rules and suggested changes☆63Updated this week
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆164Updated 3 weeks ago
- Issues related to MLPerf™ training policies, including rules and suggested changes☆95Updated this week
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆380Updated this week
- Python bindings for NVTX☆67Updated 2 years ago
- PyTorch RFCs (experimental)☆138Updated 8 months ago
- distributed-embeddings is a library for building large embedding based models in Tensorflow 2.☆46Updated 2 years ago
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆64Updated 7 months ago
- MLPerf™ logging library☆38Updated last month
- Research and development for optimizing transformers☆131Updated 4 years ago
- Reference models for Intel(R) Gaudi(R) AI Accelerator☆170Updated 3 weeks ago
- This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic …☆104Updated this week
- This repository contains the results and code for the MLPerf™ Training v1.0 benchmark.☆36Updated last year
- A library to analyze PyTorch traces.☆462Updated this week
- General policies for MLPerf® benchmarks including submission rules, coding standards, etc.☆31Updated 2 weeks ago
- ☆145Updated last year
- ☆252Updated last year
- This repository contains the results and code for the MLPerf™ Training v0.7 benchmark.☆57Updated 2 years ago
- NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the …☆255Updated this week
- The Foundation for All Legate Libraries☆233Updated last week
- ☆74Updated this week
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆156Updated this week
- Simple Distributed Deep Learning on TensorFlow☆134Updated 7 months ago
- ☆72Updated this week
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆412Updated this week
- Benchmarks to capture important workloads.☆32Updated 2 weeks ago
- JaxPP is a library for JAX that enables flexible MPMD pipeline parallelism for large-scale LLM training☆63Updated 2 weeks ago