mlcommons / trainingLinks
Reference implementations of MLPerf® training benchmarks
☆1,716Updated 3 weeks ago
Alternatives and similar repositories for training
Users that are interested in training are comparing it to the libraries listed below
Sorting:
- A benchmark framework for Tensorflow☆1,148Updated 2 years ago
- Reference implementations of MLPerf™ inference benchmarks☆1,471Updated last week
- Collective communications library with various primitives for multi-machine training.☆1,360Updated last month
- Benchmarking Deep Learning operations on different hardware☆1,096Updated 4 years ago
- nGraph has moved to OpenVINO☆1,342Updated 5 years ago
- TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.☆987Updated this week
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,449Updated last week
- Optimized primitives for collective multi-GPU communication☆4,134Updated 3 weeks ago
- ☆598Updated 7 years ago
- A domain specific language to express machine learning workloads.☆1,760Updated 2 years ago
- NCCL Tests☆1,293Updated 2 weeks ago
- ☆393Updated 2 years ago
- Compiler for Neural Network hardware accelerators☆3,311Updated last year
- Mesh TensorFlow: Model Parallelism Made Easier☆1,617Updated last year
- PyTorch elastic training☆730Updated 3 years ago
- Efficient GPU kernels for block-sparse matrix multiplication and convolution☆1,061Updated 2 years ago
- A performant and modular runtime for TensorFlow☆760Updated last month
- A GPipe implementation in PyTorch☆856Updated last year
- Enabling PyTorch on XLA Devices (e.g. Google TPU)☆2,686Updated this week
- common in-memory tensor structure☆1,077Updated last month
- ☆371Updated 7 years ago
- Make huge neural nets fit in memory☆2,814Updated 5 years ago
- A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep lear…☆5,531Updated this week
- Issues related to MLPerf™ training policies, including rules and suggested changes☆95Updated 3 weeks ago
- Low-precision matrix multiplication☆1,815Updated last year
- ☆1,655Updated 7 years ago
- Dive into Deep Learning Compiler☆646Updated 3 years ago
- A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.☆879Updated 2 weeks ago
- oneAPI Deep Neural Network Library (oneDNN)☆3,897Updated this week
- A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Bla…☆2,799Updated this week