mlcommons / inference_results_v0.7
This repository contains the results and code for the MLPerf™ Inference v0.7 benchmark.
☆17Updated last year
Related projects: ⓘ
- Benchmark scripts for TVM☆73Updated 2 years ago
- This repository contains the results and code for the MLPerf™ Inference v0.5 benchmark.☆55Updated last year
- System for automated integration of deep learning backends.☆48Updated 2 years ago
- ☆66Updated last year
- This repository contains the results and code for the MLPerf™ Training v1.0 benchmark.☆36Updated 6 months ago
- tophub autotvm log collections☆70Updated last year
- This repository contains the results and code for the MLPerf™ Training v0.7 benchmark.☆56Updated last year
- This repository contains the results and code for the MLPerf™ Inference v1.0 benchmark.☆30Updated last year
- Inference of quantization aware trained networks using TensorRT☆77Updated last year
- Benchmark code for the "Online normalizer calculation for softmax" paper☆52Updated 6 years ago
- Benchmark of TVM quantized model on CUDA☆112Updated 4 years ago
- Issues related to MLPerf™ Inference policies, including rules and suggested changes☆55Updated 2 months ago
- TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together☆63Updated 6 years ago
- Kernel Fusion and Runtime Compilation Based on NNVM☆69Updated 7 years ago
- A self-contained version of the tutorial which can be easily cloned and viewed by others.☆26Updated 5 years ago
- Python bindings for NVTX☆66Updated last year
- To make it easy to benchmark AI accelerators☆179Updated last year
- Subpart source code of of deepcore v0.7☆27Updated 4 years ago
- Issues related to MLPerf™ training policies, including rules and suggested changes☆92Updated last week
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆191Updated 2 years ago
- TensorFlow and TVM integration☆38Updated 4 years ago
- ☆26Updated last year
- Repository for SysML19 Artifacts Evaluation☆53Updated 5 years ago
- An analytical performance modeling tool for deep neural networks.☆85Updated 3 years ago
- A home for the final text of all TVM RFCs.☆99Updated 3 months ago
- ☆34Updated 2 years ago
- Code for testing the native float16 matrix multiplication performance on Tesla P100 and V100 GPU based on cublasHgemm☆34Updated 5 years ago
- PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections☆112Updated 2 years ago
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆74Updated last year
- ☆18Updated this week