mlcommons / cm4mlopsLinks
Legacy CM repository with a collection of portable, reusable and cross-platform CM automations for MLOps and MLPerf to simplify the process of building, benchmarking and optimizing AI systems across diverse models, data sets, software and hardware
☆18Updated 2 months ago
Alternatives and similar repositories for cm4mlops
Users that are interested in cm4mlops are comparing it to the libraries listed below
Sorting:
- A collection of portable workflows, automation recipes and components for MLOps in a unified CK format. Note that this repository is outd…☆18Updated 5 months ago
- Collective Knowledge repository to support artifact evaluation and reproducibility initiatives:☆55Updated 2 months ago
- AMD HPC Research Fund Cloud☆13Updated 3 weeks ago
- This repository is outdated! Join the open MLPerf workgroup to participate in the development of the next generation of automation workfl…☆32Updated 2 years ago
- Portable and customizable Collective Knowledge workflows for TVM and VTA:☆16Updated 3 years ago
- ☆19Updated this week
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆86Updated last week
- RCCL Performance Benchmark Tests☆67Updated 2 weeks ago
- General policies for MLPerf™ including submission rules, coding standards, etc.☆28Updated this week
- COCCL: Compression and precision co-aware collective communication library☆22Updated 2 months ago
- CK workflow, portable packages and other artifacts for the ReQuEST-ASPLOS'18 submission:☆13Updated 6 years ago
- CK repository with components and automation actions to enable portable workflows across diverse platforms including Linux, Windows, MacO…☆72Updated 2 years ago
- Dev repo for power measurement for the MLPerf™ benchmarks☆22Updated last month
- This repository contains the results and code for the MLPerf™ Inference v3.1 benchmark.☆11Updated 7 months ago
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆186Updated this week
- This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.☆173Updated last week
- NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the …☆173Updated this week
- Tools to deploy GPU clusters in the Cloud☆31Updated 2 years ago
- A multi-platform experimentation framework written in python.☆55Updated this week
- Validated Collective Knowledge workflows and results from the 1st ACM ReQuEST tournament on co-design of Pareto-efficient SW/HW stack for…☆13Updated 6 years ago
- Reference implementations of MLPerf™ HPC training benchmarks☆48Updated 3 months ago
- Ongoing research training transformer models at scale☆22Updated last week
- Multi-GPU communication profiler and visualizer☆29Updated 11 months ago
- ☆46Updated this week
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆56Updated last month
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆61Updated 3 months ago
- ☆20Updated 2 months ago
- Issues related to MLPerf™ Inference policies, including rules and suggested changes☆62Updated 3 months ago
- Apollo: Online Machine Learning for Performance Portability☆23Updated 9 months ago
- Issues related to MLPerf™ training policies, including rules and suggested changes☆95Updated last month