zugexiaodui / torch_flopsLinks

A library for calculating the FLOPs in the forward() process based on torch.fx

☆129

Alternatives and similar repositories for torch_flops

Users that are interested in torch_flops are comparing it to the libraries listed below

Sorting:

MzeroMiko / mamba-mini
An efficient pytorch implementation of selective scan in one file, works with both cpu and gpu, with corresponding mathematical derivatio…
☆95Updated last week
A-suozhang / Awesome-Efficient-Diffusion
Curated list of methods that focuses on improving the efficiency of diffusion models
☆44Updated last year
transformer-vq / transformer_vq
☆197Updated last year
42Shawn / PTQ4DM
Implementation of Post-training Quantization on Diffusion Models (CVPR 2023)
☆139Updated 2 years ago
VainF / Diff-Pruning
[NeurIPS 2023] Structural Pruning for Diffusion Models
☆201Updated last year
kyegomez / SwitchTransformers
Implementation of Switch Transformers from the paper: "Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficien…
☆125Updated 2 weeks ago
ThisisBillhe / EfficientDM
[ICLR 2024 Spotlight] This is the official PyTorch implementation of "EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Di…
☆66Updated last year
Juanerx / Q-DiT
[CVPR 2025] Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers
☆66Updated last year
NVlabs / GatedDeltaNet
[ICLR 2025] Official PyTorch Implementation of Gated Delta Networks: Improving Mamba2 with Delta Rule
☆331Updated last month
thu-nics / ViDiT-Q
[ICLR'25] ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation
☆126Updated 7 months ago
mit-han-lab / sparsevit
[CVPR'23] SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer
☆74Updated last year
mit-han-lab / x-attention
[ICML 2025] XAttention: Block Sparse Attention with Antidiagonal Scoring
☆239Updated 3 months ago
kyleliang919 / C-Optim
When it comes to optimizers, it's always better to be safe than sorry
☆375Updated 3 weeks ago
Dao-AILab / causal-conv1d
Causal depthwise conv1d in CUDA, with a PyTorch interface
☆626Updated this week
adreamwu / PTQ4DiT
PyTorch implementation of PTQ4DiT https://arxiv.org/abs/2405.16005
☆33Updated 11 months ago
tensorgi / TPA
[NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6) (https://arxiv.org/abs/2501.06425)
☆400Updated last month
Hsu1023 / DuQuant
[NeurIPS 2024 Oral🔥] DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs.
☆171Updated last year
VainF / MaskLLM-4V
Learnable Semi-structured Sparsity for Vision Transformers and Diffusion Transformers
☆14Updated 8 months ago
MrYxJ / calculate-flops.pytorch
The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer…
☆882Updated last year
YanjingLi0202 / Q-ViT
The official implementation of the NeurIPS 2022 paper Q-ViT.
☆98Updated 2 years ago
Efficient-ML / Awesome-Efficient-AIGC
A list of papers, docs, codes about efficient AIGC. This repo is aimed to provide the info for efficient AIGC research, including languag…
☆196Updated 8 months ago
hatchetProject / QuEST
[ICCV 2025] QuEST: Efficient Finetuning for Low-bit Diffusion Models
☆53Updated 3 months ago
OpenNLPLab / lightning-attention
Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models
☆330Updated 8 months ago
radarFudan / Awesome-state-space-models
Collection of papers on state-space models
☆601Updated last month
DZY122 / DiTAS
DiTAS: Quantizing Diffusion Transformers via Enhanced Activation Smoothing (WACV 2025)
☆11Updated 11 months ago
liuzechun / Nonuniform-to-Uniform-Quantization
Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation. In CVPR 2022.
☆133Updated 3 years ago
Xiuyu-Li / q-diffusion
[ICCV 2023] Q-Diffusion: Quantizing Diffusion Models.
☆357Updated last year
megvii-research / FQ-ViT
[IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer
☆351Updated 2 years ago
SHI-Labs / NATTEN
Fast Multi-dimensional Sparse Attention
☆636Updated 2 weeks ago
HuangOwen / Quantization-Variation
[TMLR] Official PyTorch implementation of paper "Quantization Variation: A New Perspective on Training Transformers with Low-Bit Precisio…
☆46Updated last year