mlcommons / inference_policiesLinks

Issues related to MLPerf™ Inference policies, including rules and suggested changes

☆63

Alternatives and similar repositories for inference_policies

Users that are interested in inference_policies are comparing it to the libraries listed below

Sorting:

mlcommons / training_policies
Issues related to MLPerf™ training policies, including rules and suggested changes
☆95Updated this week
intel / torch-ccl
oneCCL Bindings for Pytorch*
☆99Updated this week
NVIDIA / nvtx-plugins
Python bindings for NVTX
☆66Updated 2 years ago
mlcommons / training_results_v1.0
This repository contains the results and code for the MLPerf™ Training v1.0 benchmark.
☆38Updated last year
andersy005 / tvm-in-action
TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together
☆64Updated 7 years ago
yalue / cuda_scheduling_examiner_mirror
A tool for examining GPU scheduling behavior.
☆86Updated 11 months ago
tbd-ai / tbd-suite
☆47Updated 2 years ago
awslabs / lorien
☆43Updated last year
masahi / torchscript-to-tvm
☆69Updated 2 years ago
octoml / octoml-profile
Home for OctoML PyTorch Profiler
☆113Updated 2 years ago
facebookresearch / param
PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…
☆147Updated last week
uxlfoundation / oneCCL
oneAPI Collective Communications Library (oneCCL)
☆241Updated this week
ROCm / rccl-tests
RCCL Performance Benchmark Tests
☆71Updated last week
Intel-tensorflow / tensorflow
Computation using data flow graphs for scalable machine learning
☆68Updated this week
NVIDIA / compute-sanitizer-samples
Samples demonstrating how to use the Compute Sanitizer Tools and Public API
☆85Updated last year
facebookresearch / FAMBench
Benchmarks to capture important workloads.
☆31Updated 6 months ago
tlc-pack / TLCBench
Benchmark scripts for TVM
☆75Updated 3 years ago
apache / tvm-rfcs
A home for the final text of all TVM RFCs.
☆105Updated 10 months ago
uwsampl / tutorial
A self-contained version of the tutorial which can be easily cloned and viewed by others.
☆24Updated 6 years ago
Emma926 / paradnn
ParaDnn: A systematic performance analysis methodology for deep learning.
☆39Updated 5 years ago
intel / intel-extension-for-deepspeed
Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…
☆61Updated last month
cmu-catalyst / collage
System for automated integration of deep learning backends.
☆47Updated 2 years ago
intel / intel-xpu-backend-for-triton
OpenAI Triton backend for Intel® GPUs
☆197Updated this week
GVProf / GVProf
GVProf: A Value Profiler for GPU-based Clusters
☆51Updated last year
thu-pacman / PET
PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections
☆122Updated 3 years ago
buaa-hipo / dlcompiler-comparison
The quantitative performance comparison among DL compilers on CNN models.
☆74Updated 4 years ago
awslabs / raf
☆144Updated 6 months ago
parasj / checkmate
Training neural networks in TensorFlow 2.0 with 5x less memory
☆132Updated 3 years ago
openxla / shardy
MLIR-based partitioning system
☆115Updated this week
mlcommons / training_results_v0.7
This repository contains the results and code for the MLPerf™ Training v0.7 benchmark.
☆57Updated 2 years ago