mlcommons / inference_policies
Issues related to MLPerf™ Inference policies, including rules and suggested changes
☆60Updated last month
Alternatives and similar repositories for inference_policies:
Users that are interested in inference_policies are comparing it to the libraries listed below
- This repository contains the results and code for the MLPerf™ Training v1.0 benchmark.☆38Updated last year
- oneCCL Bindings for Pytorch*☆93Updated last week
- Issues related to MLPerf™ training policies, including rules and suggested changes☆94Updated last week
- TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together☆64Updated 6 years ago
- Python bindings for NVTX☆66Updated last year
- RCCL Performance Benchmark Tests☆60Updated this week
- ☆69Updated 2 years ago
- Benchmarks to capture important workloads.☆31Updated 2 months ago
- System for automated integration of deep learning backends.☆48Updated 2 years ago
- Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large …☆65Updated 3 years ago
- GVProf: A Value Profiler for GPU-based Clusters☆49Updated last year
- This repository contains the results and code for the MLPerf™ Training v0.5 benchmark.☆35Updated last year
- Samples demonstrating how to use the Compute Sanitizer Tools and Public API☆77Updated last year
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆134Updated last week
- PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections☆119Updated 2 years ago
- Home for OctoML PyTorch Profiler☆109Updated last year
- A tool for examining GPU scheduling behavior.☆75Updated 7 months ago
- A sandbox for quick iteration and experimentation on projects related to IREE, MLIR, and LLVM☆57Updated 3 weeks ago
- This repository contains the results and code for the MLPerf™ Training v0.7 benchmark.☆56Updated last year
- FTPipe and related pipeline model parallelism research.☆41Updated last year
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆62Updated last month
- ☆47Updated 2 years ago
- An IR for efficiently simulating distributed ML computation.☆28Updated last year
- Synthesizer for optimal collective communication algorithms☆105Updated last year
- A home for the final text of all TVM RFCs.☆102Updated 6 months ago
- NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.☆115Updated last year
- oneAPI Collective Communications Library (oneCCL)☆230Updated last week
- The quantitative performance comparison among DL compilers on CNN models.☆74Updated 4 years ago
- ☆27Updated 2 years ago
- Benchmark scripts for TVM☆74Updated 3 years ago