zugexiaodui / torch_flops
A library for calculating the FLOPs in the forward() process based on torch.fx
☆99Updated 6 months ago
Alternatives and similar repositories for torch_flops:
Users that are interested in torch_flops are comparing it to the libraries listed below
- An efficient pytorch implementation of selective scan in one file, works with both cpu and gpu, with corresponding mathematical derivatio…☆80Updated last year
- ☆188Updated last year
- [CVPR'23] SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer☆64Updated 10 months ago
- ☆40Updated last year
- The official implementation of the NeurIPS 2022 paper Q-ViT.☆88Updated last year
- Causal depthwise conv1d in CUDA, with a PyTorch interface☆401Updated 3 months ago
- Curated list of methods that focuses on improving the efficiency of diffusion models☆38Updated 8 months ago
- Implementation of Post-training Quantization on Diffusion Models (CVPR 2023)☆132Updated last year
- [CVPR 2023 Highlight] This is the official implementation of "Stitchable Neural Networks".☆248Updated last year
- [ICLR 2024 Spotlight] This is the official PyTorch implementation of "EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Di…☆58Updated 9 months ago
- [ICLR'25] ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation☆59Updated 2 weeks ago
- Awasome Papers and Resources in Deep Neural Network Pruning with Source Code.☆148Updated 6 months ago
- [NeurIPS 2023] Structural Pruning for Diffusion Models☆182Updated 8 months ago
- A sparse attention kernel supporting mix sparse patterns☆159Updated last month
- The official implementation of PTQD: Accurate Post-Training Quantization for Diffusion Models☆96Updated last year
- [ICML 2024] Official PyTorch implementation of "SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-paramete…☆96Updated 6 months ago
- Code for the NeurIPS 2022 paper "Optimal Brain Compression: A Framework for Accurate Post-Training Quantization and Pruning".☆112Updated last year
- Offical implementation of "MetaLA: Unified Optimal Linear Approximation to Softmax Attention Map" (NeurIPS2024 Oral)☆20Updated last month
- Awesome list of papers that extend Mamba to various applications.☆132Updated 2 months ago
- Recent Advances on Efficient Vision Transformers☆49Updated 2 years ago
- [ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"☆287Updated 2 months ago
- PyTorch implementation of SSQL (Accepted to ECCV2022 oral presentation)☆75Updated last year
- Efficient 2:4 sparse training algorithms and implementations☆50Updated 3 months ago
- Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation. In CVPR 2022.☆127Updated 2 years ago
- FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores☆304Updated 2 months ago
- [ICCV 23]An approach to enhance the efficiency of Vision Transformer (ViT) by concurrently employing token pruning and token merging tech…☆93Updated last year
- QuEST: Efficient Finetuning for Low-bit Diffusion Models☆41Updated last month
- A list of papers, docs, codes about efficient AIGC. This repo is aimed to provide the info for efficient AIGC research, including languag…☆174Updated last month
- ☆149Updated 2 months ago
- ☆202Updated 3 years ago