pytorch / benchmark
TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.
☆876Updated this week
Related projects ⓘ
Alternatives and complementary repositories for benchmark
- A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.☆732Updated last week
- A GPU performance profiling tool for PyTorch models☆495Updated 3 years ago
- A GPipe implementation in PyTorch☆818Updated 3 months ago
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.☆1,010Updated 7 months ago
- Pipeline Parallelism for PyTorch☆725Updated 3 months ago
- Continuous builder and binary build scripts for pytorch☆339Updated this week
- common in-memory tensor structure☆911Updated last month
- PyTorch extensions for high performance and large scale training.☆3,199Updated last week
- Enabling PyTorch on XLA Devices (e.g. Google TPU)☆2,490Updated this week
- A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs…☆1,982Updated this week
- Profiling and inspecting memory in pytorch☆1,022Updated 3 months ago
- Benchmark Suite for Deep Learning☆250Updated 3 weeks ago
- Fast Block Sparse Matrices for Pytorch☆545Updated 3 years ago
- C++ extensions in PyTorch☆1,018Updated 3 months ago
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,210Updated this week
- Accelerate PyTorch models with ONNX Runtime☆356Updated 2 months ago
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆331Updated last week
- A library to analyze PyTorch traces.☆308Updated this week
- Slicing a PyTorch Tensor Into Parallel Shards☆296Updated 3 years ago
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,597Updated this week
- Using the famous cnn model in Pytorch, we run benchmarks on various gpu.☆227Updated 4 months ago
- Tutel MoE: An Optimized Mixture-of-Experts Implementation☆736Updated this week
- functorch is JAX-like composable function transforms for PyTorch.☆1,397Updated this week
- PyTorch elastic training☆730Updated 2 years ago
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆146Updated this week
- An open-source efficient deep learning framework/compiler, written in python.☆652Updated last week
- [Prototype] Tools for the concurrent manipulation of variably sized Tensors.☆253Updated 2 years ago
- NCCL Tests☆898Updated 3 weeks ago
- Tutorial for building a custom CUDA function for Pytorch☆514Updated 5 years ago
- Efficient GPU kernels for block-sparse matrix multiplication and convolution☆1,027Updated last year