mlcommons / trainingLinks

Reference implementations of MLPerf™ training benchmarks

☆1,696

Alternatives and similar repositories for training

Users that are interested in training are comparing it to the libraries listed below

Sorting:

mlcommons / inference
Reference implementations of MLPerf™ inference benchmarks
☆1,426Updated this week
tensorflow / benchmarks
A benchmark framework for Tensorflow
☆1,150Updated last year
pytorch / gloo
Collective communications library with various primitives for multi-machine training.
☆1,332Updated this week
baidu-research / DeepBench
Benchmarking Deep Learning operations on different hardware
☆1,094Updated 4 years ago
pytorch / FBGEMM
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
☆1,415Updated this week
baidu-research / baidu-allreduce
☆588Updated 7 years ago
NVIDIA / nccl
Optimized primitives for collective multi-GPU communication
☆3,904Updated last week
NervanaSystems / ngraph
nGraph has moved to OpenVINO
☆1,347Updated 4 years ago
facebookresearch / TensorComprehensions
A domain specific language to express machine learning workloads.
☆1,760Updated 2 years ago
tensorflow / runtime
A performant and modular runtime for TensorFlow
☆758Updated 3 months ago
NVIDIA / nccl-tests
NCCL Tests
☆1,199Updated last week
msr-fiddle / pipedream
☆393Updated 2 years ago
tensorflow / mesh
Mesh TensorFlow: Model Parallelism Made Easier
☆1,613Updated last year
tensorflow / custom-op
Guide for building custom op for TensorFlow
☆382Updated 2 years ago
google / gemmlowp
Low-precision matrix multiplication
☆1,812Updated last year
pytorch / benchmark
TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.
☆966Updated last week
pytorch / glow
Compiler for Neural Network hardware accelerators
☆3,310Updated last year
jiazhihao / TASO
The Tensor Algebra SuperOptimizer for Deep Learning
☆726Updated 2 years ago
openai / blocksparse
Efficient GPU kernels for block-sparse matrix multiplication and convolution
☆1,043Updated 2 years ago
dmlc / dlpack
common in-memory tensor structure
☆1,042Updated last month
d2l-ai / d2l-tvm
Dive into Deep Learning Compiler
☆646Updated 3 years ago
tensorflow / tensorrt
TensorFlow/TensorRT integration
☆743Updated last year
NVIDIA / TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Bla…
☆2,587Updated this week
alibaba / ai-matrix
To make it easy to benchmark AI accelerators
☆185Updated 2 years ago
uxlfoundation / oneDNN
oneAPI Deep Neural Network Library (oneDNN)
☆3,856Updated this week
mlcommons / training_policies
Issues related to MLPerf™ training policies, including rules and suggested changes
☆95Updated this week
kakaobrain / torchgpipe
A GPipe implementation in PyTorch
☆846Updated last year
pytorch / elastic
PyTorch elastic training
☆729Updated 3 years ago
baidu-research / tensorflow-allreduce
☆371Updated 7 years ago
pytorch / kineto
A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.
☆842Updated last week