mlcommons / cm4mlops
A collection of portable, reusable and cross-platform CM automations for MLOps and MLPerf to simplify the process of building, benchmarking and optimizing AI systems across diverse models, data sets, software and hardware
☆16Updated this week
Alternatives and similar repositories for cm4mlops:
Users that are interested in cm4mlops are comparing it to the libraries listed below
- This repository is outdated! Join the open MLPerf workgroup to participate in the development of the next generation of automation workfl…☆32Updated 2 years ago
- ☆18Updated last month
- A collection of portable workflows, automation recipes and components for MLOps in a unified CK format. Note that this repository is outd…☆18Updated last month
- Tools to deploy GPU clusters in the Cloud☆30Updated last year
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆58Updated last month
- ☆34Updated this week
- MLPerf™ logging library☆32Updated last week
- A system validation and diagnostics tool for monitoring, stress testing, detecting, and troubleshooting issues impacting AMD GPUs in high…☆66Updated this week
- Large Language Model Text Generation Inference on Habana Gaudi☆29Updated this week
- Collective Knowledge repository to support artifact evaluation and reproducibility initiatives:☆52Updated 5 months ago
- A tracing infrastructure for heterogeneous computing applications.☆26Updated this week
- Bandwidth test for ROCm☆52Updated this week
- This repository contains the results and code for the MLPerf™ Training v3.0 benchmark.☆12Updated last year
- A multi-platform experimentation framework written in python.☆43Updated this week
- Intel® SHMEM - Device initiated shared memory based communication library☆22Updated 2 months ago
- ☆41Updated 3 weeks ago
- OpenAI Triton backend for Intel® GPUs☆154Updated this week
- oneAPI Technical Advisory Board (TAB) Meeting Notes☆72Updated 11 months ago
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆165Updated this week
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆48Updated this week
- Apollo: Online Machine Learning for Performance Portability☆22Updated 4 months ago
- A simplified and automated orchestration workflow to perform ML end-to-end (E2E) model tests and benchmarking on Cloud VMs across differe…☆31Updated this week
- Benchmarks to capture important workloads.☆29Updated this week
- RCCL Performance Benchmark Tests☆55Updated this week
- Analyze parallel execution traces using pandas dataframes☆22Updated 3 weeks ago
- CloudAI Benchmark Framework☆47Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆47Updated this week
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆128Updated last week
- This repository contains the results and code for the MLPerf™ Inference v3.1 benchmark.☆11Updated 3 months ago
- An HPL-AI implementation for Fugaku☆19Updated 3 years ago