pytorch / benchmarkLinks
TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.
☆1,002Updated this week
Alternatives and similar repositories for benchmark
Users that are interested in benchmark are comparing it to the libraries listed below
Sorting:
- A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.☆914Updated this week
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.☆1,071Updated last year
- A GPU performance profiling tool for PyTorch models☆509Updated 4 years ago
- Pipeline Parallelism for PyTorch☆783Updated last year
- Continuous builder and binary build scripts for pytorch☆356Updated 5 months ago
- Benchmark Suite for Deep Learning☆280Updated 3 weeks ago
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,516Updated this week
- A GPipe implementation in PyTorch☆861Updated last year
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆411Updated this week
- common in-memory tensor structure☆1,139Updated last month
- PyTorch extensions for high performance and large scale training.☆3,393Updated 8 months ago
- Collective communications library with various primitives for multi-machine training.☆1,386Updated this week
- A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.☆1,246Updated this week
- A library to analyze PyTorch traces.☆454Updated last month
- An open-source efficient deep learning framework/compiler, written in python.☆739Updated 4 months ago
- Enabling PyTorch on XLA Devices (e.g. Google TPU)☆2,739Updated 3 weeks ago
- A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on H…☆3,081Updated this week
- Reference implementations of MLPerf® training benchmarks☆1,736Updated last month
- C++ extensions in PyTorch☆1,177Updated 6 months ago
- Reference implementations of MLPerf® inference benchmarks☆1,511Updated this week
- Efficient GPU kernels for block-sparse matrix multiplication and convolution☆1,062Updated 2 years ago
- functorch is JAX-like composable function transforms for PyTorch.☆1,437Updated 4 months ago
- Using the famous cnn model in Pytorch, we run benchmarks on various gpu.☆247Updated last year
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,920Updated this week
- Profiling and inspecting memory in pytorch☆1,076Updated 4 months ago
- Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackab…☆1,585Updated last year
- A tensor-aware point-to-point communication primitive for machine learning☆283Updated 3 weeks ago
- NCCL Tests☆1,401Updated last week
- Provide Python access to the NVML library for GPU diagnostics☆257Updated 4 months ago
- Tutel MoE: Optimized Mixture-of-Experts Library, Support GptOss/DeepSeek/Kimi-K2/Qwen3 using FP8/NVFP4/MXFP4☆955Updated 3 weeks ago