ultralytics / thopLinks
Profile PyTorch models for FLOPs and parameters, helping to evaluate computational efficiency and memory usage.
☆47Updated 2 months ago
Alternatives and similar repositories for thop
Users that are interested in thop are comparing it to the libraries listed below
Sorting:
- This repository contains the training code of ParetoQ introduced in our work "ParetoQ Scaling Laws in Extremely Low-bit LLM Quantization"☆87Updated last month
- List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.☆92Updated last year
- [CVPR 2024] PTQ4SAM: Post-Training Quantization for Segment Anything☆78Updated last year
- Code for the NeurIPS 2022 paper "Optimal Brain Compression: A Framework for Accurate Post-Training Quantization and Pruning".☆122Updated 2 years ago
- Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. Th…☆406Updated last week
- ☆152Updated 2 years ago
- ☆206Updated 3 years ago
- Easily benchmark PyTorch model FLOPs, latency, throughput, allocated gpu memory and energy consumption☆103Updated last year
- Fast Hadamard transform in CUDA, with a PyTorch interface☆206Updated last year
- [IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer☆344Updated 2 years ago
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆149Updated 3 weeks ago
- ☆22Updated last year
- A repository dedicated to evaluating the performance of quantizied LLaMA3 using various quantization methods..☆191Updated 6 months ago
- This library empowers users to seamlessly port pretrained models and checkpoints on the HuggingFace (HF) hub (developed using HF transfor…☆72Updated this week
- The official implementation of the NeurIPS 2022 paper Q-ViT.☆96Updated 2 years ago
- Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Model…☆63Updated last year
- Notes on quantization in neural networks☆90Updated last year
- OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM☆305Updated 10 months ago
- List of papers related to neural network quantization in recent AI conferences and journals.☆669Updated 3 months ago
- ☆263Updated 10 months ago
- ☆172Updated last year
- ☆237Updated 2 years ago
- Recent Advances on Efficient Vision Transformers☆51Updated 2 years ago
- [ICML 2024 Oral] Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs☆111Updated 2 weeks ago
- On-Device Training Under 256KB Memory [NeurIPS'22]☆483Updated last year
- Post-Training Quantization for Vision transformers.☆221Updated 3 years ago
- [NeurIPS 2023] MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited Memory☆69Updated last year
- Code repo for the paper "SpinQuant LLM quantization with learned rotations"☆302Updated 5 months ago
- ☆153Updated 2 years ago
- Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…☆46Updated last year