LukasHedegaard / pytorch-benchmark
Easily benchmark PyTorch model FLOPs, latency, throughput, allocated gpu memory and energy consumption
☆92Updated last year
Related projects ⓘ
Alternatives and complementary repositories for pytorch-benchmark
- Neural Architecture Search for Neural Network Libraries☆57Updated 10 months ago
- Example code for profiler workshop☆29Updated 2 years ago
- ☆156Updated last year
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆146Updated this week
- ☆195Updated 3 years ago
- This repository contains the experimental PyTorch native float8 training UX☆212Updated 3 months ago
- Efficient CUDA kernels for training convolutional neural networks with PyTorch.☆35Updated this week
- Collection of SOTA efficient computer vision models for embedded applications, with pre-trained weights and training recipes☆82Updated this week
- Memory Optimizations for Deep Learning (ICML 2023)☆60Updated 8 months ago
- A code generator from ONNX to PyTorch code☆133Updated 2 years ago
- ☆123Updated last year
- Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…☆42Updated last year
- FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores☆281Updated last month
- ☆23Updated 4 months ago
- Demystify RAM Usage in Multi-Process Data Loaders☆183Updated last year
- A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to fac…☆216Updated this week
- pytorch-profiler☆50Updated last year
- Torch Distributed Experimental☆116Updated 3 months ago
- PyTorch extension for emulating FP8 data formats on standard FP32 Xeon/GPU hardware.☆100Updated 11 months ago
- Profile PyTorch models for FLOPs and parameters, helping to evaluate computational efficiency and memory usage.☆20Updated this week
- Context Manager to profile the forward and backward times of PyTorch's nn.Module☆83Updated last year
- Official PyTorch implementation of the paper: "Solving ImageNet: a Unified Scheme for Training any Backbone to Top Results" (2022)☆192Updated last year
- Applied AI experiments and examples for PyTorch☆168Updated 3 weeks ago
- A simple program to calculate and visualize the FLOPs and Parameters of Pytorch models, with handy CLI and easy-to-use Python API.☆119Updated 11 months ago
- Dynamic Neural Architecture Search Toolkit☆29Updated 5 months ago
- Fast Hadamard transform in CUDA, with a PyTorch interface☆111Updated 6 months ago
- A custom pytorch Dataset extension that provides a faster iteration and better RAM usage☆42Updated 8 months ago
- ☆24Updated 7 months ago
- ☆134Updated last year