mlcommons / policiesLinks

General policies for MLPerf™ including submission rules, coding standards, etc.

☆30

Alternatives and similar repositories for policies

Users that are interested in policies are comparing it to the libraries listed below

Sorting:

pytorch / test-infra
This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic …
☆96Updated this week
mlcommons / logging
MLPerf™ logging library
☆36Updated last week
octoml / octoml-profile
Home for OctoML PyTorch Profiler
☆113Updated 2 years ago
mlcommons / power-dev
Dev repo for power measurement for the MLPerf™ benchmarks
☆24Updated 4 months ago
NVIDIA / mlperf-common
NVIDIA's launch, startup, and logging scripts used by our MLPerf Training and HPC submissions
☆29Updated 2 weeks ago
onnx / steering-committee
Notes and artifacts from the ONNX steering committee
☆26Updated last week
mlcommons / inference_policies
Issues related to MLPerf™ Inference policies, including rules and suggested changes
☆63Updated last week
pytorch / torchx
TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…
☆381Updated this week
facebookresearch / param
PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…
☆149Updated this week
mlcommons / mlcube
MLCube® is a project that reduces friction for machine learning by ensuring that models are easily portable and reproducible.
☆157Updated 10 months ago
pytorch / torchsnapshot
A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…
☆158Updated last month
intel / torch-ccl
oneCCL Bindings for Pytorch*
☆99Updated this week
CentML / DeepView.Profile
🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.
☆64Updated 6 months ago
coreweave / ml-containers
☆38Updated this week
HabanaAI / Model-References
Reference models for Intel(R) Gaudi(R) AI Accelerator
☆167Updated 2 weeks ago
run-ai / rntop
A top-like tool for monitoring GPUs in a cluster
☆85Updated last year
pytorch / rfcs
PyTorch RFCs (experimental)
☆134Updated 2 months ago
NVIDIA / Fuser
A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
☆346Updated this week
mlcommons / training_policies
Issues related to MLPerf™ training policies, including rules and suggested changes
☆95Updated last week
GoogleCloudPlatform / slurm-gcp
☆51Updated 3 weeks ago
fmperf-project / fmperf
Cloud Native Benchmarking of Foundation Models
☆39Updated last week
gpuopenanalytics / pynvml
Provide Python access to the NVML library for GPU diagnostics
☆243Updated 8 months ago
mlcommons / training_results_v1.0
This repository contains the results and code for the MLPerf™ Training v1.0 benchmark.
☆37Updated last year
NVIDIA / nvidia-resiliency-ext
NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the …
☆196Updated last week
facebookresearch / HolisticTraceAnalysis
A library to analyze PyTorch traces.
☆402Updated this week
facebookresearch / FAMBench
Benchmarks to capture important workloads.
☆31Updated 6 months ago
GoogleCloudPlatform / ml-testing-accelerators
Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)
☆64Updated last month
groq / mlagility
Machine Learning Agility (MLAgility) benchmark and benchmarking tools
☆39Updated last week
NVIDIA / nvtx-plugins
Python bindings for NVTX
☆66Updated 2 years ago
rapidsai / ucx-py
Python bindings for UCX
☆137Updated this week