ultralytics / thop
Profile PyTorch models for FLOPs and parameters, helping to evaluate computational efficiency and memory usage.
☆17Updated last week
Related projects ⓘ
Alternatives and complementary repositories for thop
- Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. Th…☆324Updated this week
- Collection of SOTA efficient computer vision models for embedded applications, with pre-trained weights and training recipes☆82Updated 3 weeks ago
- Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.☆81Updated last week
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆126Updated 2 weeks ago
- NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.☆179Updated 5 months ago
- Edge AI Model Development Tools☆33Updated 3 weeks ago
- Low Precision(quantized) Yolov5☆31Updated 9 months ago
- [CVPR 2024] PTQ4SAM: Post-Training Quantization for Segment Anything☆55Updated 4 months ago
- ☆121Updated last year
- Timm model explorer☆36Updated 7 months ago
- Implementation for the paper 'YOLO-ReT: Towards High Accuracy Real-time Object Detection on Edge GPUs'☆94Updated last year
- TAO Toolkit deep learning networks with PyTorch backend☆86Updated last week
- DeltaCNN End-to-End CNN Inference of Sparse Frame Differences in Videos☆60Updated last year
- Scailable ONNX python tools☆96Updated 2 weeks ago
- Quick start scripts and tutorial notebooks to get started with TAO Toolkit☆45Updated 2 months ago
- Count number of parameters / MACs / FLOPS for ONNX models.☆88Updated 2 weeks ago
- ☆56Updated 2 years ago
- A tutorial introducing knowledge distillation as an optimization technique for deployment on NVIDIA Jetson☆152Updated 11 months ago
- A set of simple tools for splitting, merging, OP deletion, size compression, rewriting attributes and constants, OP generation, change op…☆277Updated 6 months ago
- OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM☆292Updated last month
- ☆82Updated last month
- TFLite model analyzer & memory optimizer☆120Updated 9 months ago
- [NeurIPS 2023] MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited Memory☆63Updated last year
- ☆192Updated 3 years ago
- The no-code AI toolchain☆74Updated 3 weeks ago
- The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.☆123Updated last week
- ☆27Updated last year
- We have implemented a framework that supports developers to structured prune neural networks of Tensorflow Models☆27Updated last week
- Jetson embedded platform-target deep learning inference acceleration framework with TensorRT☆24Updated last week