mlcommons / policiesLinks
General policies for MLPerf™ including submission rules, coding standards, etc.
☆31Updated last week
Alternatives and similar repositories for policies
Users that are interested in policies are comparing it to the libraries listed below
Sorting:
- This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic …☆102Updated this week
- MLPerf™ logging library☆37Updated this week
- Dev repo for power measurement for the MLPerf™ benchmarks☆24Updated last month
- Home for OctoML PyTorch Profiler☆114Updated 2 years ago
- NVIDIA's launch, startup, and logging scripts used by our MLPerf Training and HPC submissions☆32Updated last month
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆394Updated this week
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆161Updated 3 weeks ago
- MLCube® is a project that reduces friction for machine learning by ensuring that models are easily portable and reproducible.☆157Updated last year
- Issues related to MLPerf™ Inference policies, including rules and suggested changes☆64Updated last month
- Notes and artifacts from the ONNX steering committee☆26Updated this week
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆152Updated this week
- PyTorch RFCs (experimental)☆135Updated 4 months ago
- A top-like tool for monitoring GPUs in a cluster☆85Updated last year
- A library to analyze PyTorch traces.☆416Updated last week
- Cloud Native Benchmarking of Foundation Models☆44Updated 2 months ago
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆357Updated this week
- oneCCL Bindings for Pytorch*☆102Updated 2 months ago
- Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)☆65Updated 3 months ago
- Benchmarks to capture important workloads.☆31Updated 8 months ago
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆381Updated 4 months ago
- Issues related to MLPerf™ training policies, including rules and suggested changes☆95Updated 2 weeks ago
- Measure and optimize the energy consumption of your AI applications!☆296Updated last week
- ☆56Updated this week
- A tensor-aware point-to-point communication primitive for machine learning☆273Updated last month
- NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the …☆226Updated last week
- This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.☆185Updated last week
- ☆28Updated 3 months ago
- ☆145Updated last week
- The Triton backend for the PyTorch TorchScript models.☆160Updated this week
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆63Updated 3 months ago