ultralytics / thopLinks
Profile PyTorch models for FLOPs and parameters, helping to evaluate computational efficiency and memory usage.
☆108Updated last week
Alternatives and similar repositories for thop
Users that are interested in thop are comparing it to the libraries listed below
Sorting:
- Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. Th…☆428Updated this week
- [CVPR 2024] PTQ4SAM: Post-Training Quantization for Segment Anything☆81Updated last year
- Easily benchmark PyTorch model FLOPs, latency, throughput, allocated gpu memory and energy consumption☆109Updated 2 years ago
- When it comes to optimizers, it's always better to be safe than sorry☆397Updated 3 months ago
- ☆193Updated 7 months ago
- Implementation for the paper 'YOLO-ReT: Towards High Accuracy Real-time Object Detection on Edge GPUs'☆103Updated 2 years ago
- Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.☆129Updated 8 months ago
- OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM☆311Updated last year
- Recent Advances on Efficient Vision Transformers☆55Updated 3 years ago
- [ECCV 2024] Isomorphic Pruning for Vision Models☆78Updated last year
- Export utility for unconstrained channel pruned models☆72Updated 2 years ago
- [NeurIPS 2023] MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited Memory☆75Updated 2 years ago
- Generalist YOLO: Towards Real-Time End-to-End Multi-Task Visual Language Models☆86Updated 8 months ago
- A library for calculating the FLOPs in the forward() process based on torch.fx☆134Updated 2 weeks ago
- A Toolkit to Help Optimize Onnx Model☆300Updated this week
- A Toolkit to Help Optimize Large Onnx Model☆162Updated 2 months ago
- Timm model explorer☆42Updated last year
- High Performance Int8 GEMM Kernels for SM80 and later GPUs.☆18Updated 10 months ago
- Get down and dirty with FlashAttention2.0 in pytorch, plug in and play no complex CUDA kernels☆113Updated 2 years ago
- Code for the NeurIPS 2022 paper "Optimal Brain Compression: A Framework for Accurate Post-Training Quantization and Pruning".☆129Updated 2 years ago
- ☆168Updated 2 years ago
- Zero-label image classification via OpenCLIP knowledge distillation☆141Updated 2 years ago
- List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.☆98Updated last year
- Notes on quantization in neural networks☆114Updated 2 years ago
- Ultralytics YOLO with Additional Knowledge Distillation Capability☆81Updated 11 months ago
- Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…☆50Updated 2 years ago
- Awesome Pruning. ✅ Curated Resources for Neural Network Pruning.☆172Updated last year
- Ultralytics LLM-related experiments☆52Updated 2 weeks ago
- This library empowers users to seamlessly port pretrained models and checkpoints on the HuggingFace (HF) hub (developed using HF transfor…☆85Updated this week
- 🔨🔨🔨(mmplot)used to draw graphs of multiple index parameters such as algorithm accuracy and speed of multiple deep learning models.☆86Updated last year