mlcommons / training
Reference implementations of MLPerf™ training benchmarks
☆1,658Updated this week
Alternatives and similar repositories for training:
Users that are interested in training are comparing it to the libraries listed below
- Reference implementations of MLPerf™ inference benchmarks☆1,351Updated this week
- Collective communications library with various primitives for multi-machine training.☆1,288Updated this week
- Benchmarking Deep Learning operations on different hardware☆1,082Updated 3 years ago
- A benchmark framework for Tensorflow☆1,151Updated last year
- nGraph has moved to OpenVINO☆1,350Updated 4 years ago
- A domain specific language to express machine learning workloads.☆1,759Updated last year
- ☆580Updated 7 years ago
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,293Updated this week
- TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.☆932Updated this week
- ☆1,659Updated 6 years ago
- Mesh TensorFlow: Model Parallelism Made Easier☆1,605Updated last year
- NCCL Tests☆1,059Updated 3 weeks ago
- PyTorch elastic training☆730Updated 2 years ago
- Dive into Deep Learning Compiler☆646Updated 2 years ago
- ☆372Updated 7 years ago
- A performant and modular runtime for TensorFlow☆759Updated last month
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆982Updated 6 months ago
- Low-precision matrix multiplication☆1,798Updated last year
- Compiler for Neural Network hardware accelerators☆3,278Updated 11 months ago
- Efficient GPU kernels for block-sparse matrix multiplication and convolution☆1,038Updated last year
- A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.☆1,530Updated 2 months ago
- Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators☆1,538Updated 5 years ago
- "Multi-Level Intermediate Representation" Compiler Infrastructure☆1,743Updated 3 years ago
- DAWNBench: An End-to-End Deep Learning Benchmark and Competition☆261Updated 4 years ago
- TensorFlow/TensorRT integration☆740Updated last year
- Optimized primitives for collective multi-GPU communication☆3,641Updated 2 weeks ago
- oneAPI Deep Neural Network Library (oneDNN)☆3,769Updated this week
- A GPipe implementation in PyTorch☆835Updated 8 months ago
- To make it easy to benchmark AI accelerators☆183Updated 2 years ago
- Facebook AI Performance Evaluation Platform☆390Updated 2 months ago