zugexiaodui / torch_flopsLinks
A library for calculating the FLOPs in the forward() process based on torch.fx
☆115Updated 2 months ago
Alternatives and similar repositories for torch_flops
Users that are interested in torch_flops are comparing it to the libraries listed below
Sorting:
- ☆191Updated last year
- An efficient pytorch implementation of selective scan in one file, works with both cpu and gpu, with corresponding mathematical derivatio…☆90Updated last year
- [CVPR 2023 Highlight] This is the official implementation of "Stitchable Neural Networks".☆247Updated 2 years ago
- [ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"☆335Updated 6 months ago
- Awesome list of papers that extend Mamba to various applications.☆133Updated last week
- [CVPR'23] SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer☆70Updated last year
- [NeurIPS 2023] Structural Pruning for Diffusion Models☆197Updated 11 months ago
- Curated list of methods that focuses on improving the efficiency of diffusion models☆45Updated 11 months ago
- [ICLR 2024 Spotlight] This is the official PyTorch implementation of "EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Di…☆60Updated last year
- Implementation of Post-training Quantization on Diffusion Models (CVPR 2023)☆139Updated 2 years ago
- Causal depthwise conv1d in CUDA, with a PyTorch interface☆494Updated 3 weeks ago
- The official implementation of the NeurIPS 2022 paper Q-ViT.☆92Updated 2 years ago
- FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores☆319Updated 5 months ago
- Implementation of Soft MoE, proposed by Brain's Vision team, in Pytorch☆298Updated 2 months ago
- Implementation of Switch Transformers from the paper: "Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficien…☆108Updated 2 months ago
- Official PyTorch Implementation of "The Hidden Attention of Mamba Models"☆223Updated last year
- Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models☆313Updated 3 months ago
- [ICCV 23]An approach to enhance the efficiency of Vision Transformer (ViT) by concurrently employing token pruning and token merging tech…☆96Updated last year
- XAttention: Block Sparse Attention with Antidiagonal Scoring☆164Updated 2 weeks ago
- [ICLR 2025 Spotlight] Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures☆472Updated 4 months ago
- Recent Advances on Efficient Vision Transformers☆51Updated 2 years ago
- Offical implementation of "MetaLA: Unified Optimal Linear Approximation to Softmax Attention Map" (NeurIPS2024 Oral)☆25Updated 5 months ago
- 🔥 A minimal training framework for scaling FLA models☆170Updated last week
- [ICLR 2025] Official PyTorch Implementation of Gated Delta Networks: Improving Mamba2 with Delta Rule☆173Updated 3 months ago
- Collection of papers on state-space models☆595Updated last month
- Open source implementation of "Vision Transformers Need Registers"☆180Updated 2 months ago
- A Triton Kernel for incorporating Bi-Directionality in Mamba2☆70Updated 6 months ago
- Official PyTorch implementation of A-ViT: Adaptive Tokens for Efficient Vision Transformer (CVPR 2022)☆158Updated 2 years ago
- (Unofficial) PyTorch implementation of grouped-query attention (GQA) from "GQA: Training Generalized Multi-Query Transformer Models from …☆170Updated last year
- Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"☆138Updated 4 months ago