mlcommons / inference_results_v3.0Links
This repository contains the results and code for the MLPerf™ Inference v3.0 benchmark.
☆19Updated 3 weeks ago
Alternatives and similar repositories for inference_results_v3.0
Users that are interested in inference_results_v3.0 are comparing it to the libraries listed below
Sorting:
- COCCL: Compression and precision co-aware collective communication library☆24Updated 4 months ago
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆61Updated 3 weeks ago
- ☆49Updated this week
- ☆28Updated 6 months ago
- oneCCL Bindings for Pytorch*☆99Updated last week
- ParaDnn: A systematic performance analysis methodology for deep learning.☆39Updated 5 years ago
- RCCL Performance Benchmark Tests☆70Updated this week
- CUDA templates for tile-sparse matrix multiplication based on CUTLASS.☆51Updated 7 years ago
- parser script to process pytorch autograd profiler result, convert json file to excel.☆14Updated 5 years ago
- Research and development for optimizing transformers☆129Updated 4 years ago
- A CUTLASS implementation using SYCL☆32Updated last week
- ☆251Updated 11 months ago
- Sparsity support for PyTorch☆35Updated 4 months ago
- Python bindings for UCX☆137Updated this week
- This repository contains the results and code for the MLPerf™ Training v2.1 benchmark.☆15Updated last year
- A hierarchical collective communications library with portable optimizations☆35Updated 7 months ago
- A Deep Learning Meta-Framework and HPC Benchmarking Library☆81Updated 3 years ago
- Benchmarks to capture important workloads.☆31Updated 5 months ago
- Reference implementations of MLPerf™ HPC training benchmarks☆48Updated 4 months ago
- Provides the examples to write and build Habana custom kernels using the HabanaTools☆22Updated 3 months ago
- Test suite for probing the numerical behavior of NVIDIA tensor cores☆40Updated last year
- MLIR-based partitioning system☆107Updated this week
- A Data-Centric Compiler for Machine Learning☆84Updated last year
- ☆16Updated 2 years ago
- ☆18Updated 5 years ago
- Ahead of Time (AOT) Triton Math Library☆72Updated last week
- Bandwidth test for ROCm☆60Updated last week
- A library of GPU kernels for sparse matrix operations.☆270Updated 4 years ago
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆147Updated 2 weeks ago
- This is the public repo for the MLPerf DeepCAM climate data segmentation proposal.☆16Updated 3 years ago