mlcommons / inference_policiesLinks
Issues related to MLPerf® Inference policies, including rules and suggested changes
☆64Updated 3 weeks ago
Alternatives and similar repositories for inference_policies
Users that are interested in inference_policies are comparing it to the libraries listed below
Sorting:
- oneCCL Bindings for Pytorch* (deprecated)☆103Updated last month
- Issues related to MLPerf™ training policies, including rules and suggested changes☆95Updated 2 months ago
- Python bindings for NVTX☆67Updated 2 years ago
- ☆47Updated 2 years ago
- This repository contains the results and code for the MLPerf™ Training v1.0 benchmark.☆36Updated last year
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆153Updated last week
- Samples demonstrating how to use the Compute Sanitizer Tools and Public API☆90Updated 2 years ago
- A tool for examining GPU scheduling behavior.☆89Updated last year
- oneAPI Collective Communications Library (oneCCL)☆248Updated this week
- ☆68Updated 2 years ago
- TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together☆64Updated 7 years ago
- Splits single Nvidia GPU into multiple partitions with complete compute and memory isolation (wrt to performace) between the partitions☆164Updated 6 years ago
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆63Updated 5 months ago
- Benchmark scripts for TVM☆74Updated 3 years ago
- Training material for Nsight developer tools☆173Updated last year
- RCCL Performance Benchmark Tests☆81Updated last week
- ParaDnn: A systematic performance analysis methodology for deep learning.☆40Updated 5 years ago
- ☆42Updated 2 years ago
- PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections☆121Updated 3 years ago
- Home for OctoML PyTorch Profiler☆114Updated 2 years ago
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆130Updated this week
- A home for the final text of all TVM RFCs.☆108Updated last year
- Automated machine learning as an AI-HPC benchmark☆65Updated 3 years ago
- AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and ver…☆279Updated 3 months ago
- A sandbox for quick iteration and experimentation on projects related to IREE, MLIR, and LLVM☆61Updated 8 months ago
- NVIDIA's launch, startup, and logging scripts used by our MLPerf Training and HPC submissions☆35Updated 2 months ago
- Synthesizer for optimal collective communication algorithms☆121Updated last year
- System for automated integration of deep learning backends.☆47Updated 3 years ago
- Magnum IO community repo☆104Updated this week
- Fine-grained GPU sharing primitives☆147Updated 4 months ago