vra / flopthLinks
A simple program to calculate and visualize the FLOPs and Parameters of Pytorch models, with handy CLI and easy-to-use Python API.
☆129Updated 8 months ago
Alternatives and similar repositories for flopth
Users that are interested in flopth are comparing it to the libraries listed below
Sorting:
- TF/Keras code for DiffStride, a pooling layer with learnable strides.☆124Updated 3 years ago
- Implementation of fused cosine similarity attention in the same style as Flash Attention☆214Updated 2 years ago
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆51Updated 3 years ago
- [ICLR 2022] "Deep AutoAugment" by Yu Zheng, Zhi Zhang, Shen Yan, Mi Zhang☆65Updated 10 months ago
- Implementation of the Adan (ADAptive Nesterov momentum algorithm) Optimizer in Pytorch☆251Updated 2 years ago
- Seamless analysis of your PyTorch models (RAM usage, FLOPs, MACs, receptive field, etc.)☆221Updated 4 months ago
- Optimized library for large-scale extraction of frames and audio from video.☆204Updated last year
- ☆51Updated 2 years ago
- A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch☆100Updated last year
- ☆74Updated 2 years ago
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆124Updated last year
- Context Manager to profile the forward and backward times of PyTorch's nn.Module☆83Updated last year
- Code release for "Dropout Reduces Underfitting"☆314Updated 2 years ago
- Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts☆120Updated 9 months ago
- Code repository of the paper "Modelling Long Range Dependencies in ND: From Task-Specific to a General Purpose CNN" https://arxiv.org/abs…☆184Updated 3 months ago
- Official implementation of "Active Image Indexing"☆59Updated 2 years ago
- A simple minimal implementation of Reversible Vision Transformers☆125Updated last year
- Pytorch cyclic cosine decay learning rate scheduler☆49Updated 4 years ago
- Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena☆204Updated last year
- Implementation of Fast Transformer in Pytorch☆175Updated 3 years ago
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"☆73Updated 2 years ago
- Examples for the WebDataset PyTorch Dataset Library☆50Updated 4 years ago
- An unopinionated replacement for PyTorch's Dataset and ImageFolder, that handles Tar archives☆77Updated 3 years ago
- Deep Learning project template for PyTorch (multi-gpu training is supported)☆139Updated last year
- Recent Advances in MLP-based Models (MLP is all you need!)☆116Updated 2 years ago
- Implementation of Linformer for Pytorch☆295Updated last year
- Implementation of RQ Transformer, proposed in the paper "Autoregressive Image Generation using Residual Quantization"☆112Updated 3 years ago
- Figures I made during my PhD in Deep Learning, for my models and for context☆83Updated 4 years ago
- Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).☆225Updated 3 years ago
- PyTorch repository for ICLR 2022 paper (GSAM) which improves generalization (e.g. +3.8% top-1 accuracy on ImageNet with ViT-B/32)☆144Updated 2 years ago