vra / flopthLinks
A simple program to calculate and visualize the FLOPs and Parameters of Pytorch models, with handy CLI and easy-to-use Python API.
☆131Updated 11 months ago
Alternatives and similar repositories for flopth
Users that are interested in flopth are comparing it to the libraries listed below
Sorting:
- Implementation of fused cosine similarity attention in the same style as Flash Attention☆217Updated 2 years ago
- TF/Keras code for DiffStride, a pooling layer with learnable strides.☆124Updated 3 years ago
- Seamless analysis of your PyTorch models (RAM usage, FLOPs, MACs, receptive field, etc.)☆222Updated 7 months ago
- [ICLR 2022] "Deep AutoAugment" by Yu Zheng, Zhi Zhang, Shen Yan, Mi Zhang☆65Updated last year
- Implementation of the Adan (ADAptive Nesterov momentum algorithm) Optimizer in Pytorch☆252Updated 3 years ago
- Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).☆228Updated 3 years ago
- Code repository of the paper "Modelling Long Range Dependencies in ND: From Task-Specific to a General Purpose CNN" https://arxiv.org/abs…☆183Updated 5 months ago
- Transformers w/o Attention, based fully on MLPs☆95Updated last year
- Estimate/count FLOPS for a given neural network using pytorch☆306Updated 3 years ago
- ☆52Updated 2 years ago
- Code release for "Dropout Reduces Underfitting"☆315Updated 2 years ago
- Recent Advances in MLP-based Models (MLP is all you need!)☆117Updated 2 years ago
- Pytorch cyclic cosine decay learning rate scheduler☆49Updated 4 years ago
- Examples for the WebDataset PyTorch Dataset Library☆51Updated 4 years ago
- Implementation of ResMLP, an all MLP solution to image classification, in Pytorch☆200Updated 2 years ago
- ☆75Updated 2 years ago
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆51Updated 3 years ago
- Implementation of Online Label Smoothing in PyTorch☆95Updated 3 years ago
- A simple minimal implementation of Reversible Vision Transformers☆126Updated last year
- A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch☆110Updated 2 months ago
- Implementation of Fast Transformer in Pytorch☆177Updated 4 years ago
- A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to fac…☆242Updated last month
- Context Manager to profile the forward and backward times of PyTorch's nn.Module☆82Updated 2 years ago
- Implementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks, de…☆102Updated 3 years ago
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆123Updated last year
- Unofficial PyTorch implementation of Google's FNet: Mixing Tokens with Fourier Transforms. With checkpoints.☆77Updated 3 years ago
- Adaptive Gradient Clipping☆151Updated 3 years ago
- Pytorch reimplementation of the Mixer (MLP-Mixer: An all-MLP Architecture for Vision)☆36Updated 4 years ago
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"☆74Updated 2 years ago
- Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena☆206Updated 2 years ago