NVlabs / HALP
☆66Updated 2 years ago
Alternatives and similar repositories for HALP:
Users that are interested in HALP are comparing it to the libraries listed below
- This project aims to explore the deployment of Swin-Transformer based on TensorRT, including the test results of FP16 and INT8.☆164Updated 2 years ago
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆71Updated 2 years ago
- Offline Quantization Tools for Deploy.☆125Updated last year
- A set of examples around MegEngine☆31Updated last year
- ☆33Updated last year
- [CVPR 2024] PTQ4SAM: Post-Training Quantization for Segment Anything☆65Updated 8 months ago
- [CVPR 2023] PD-Quant: Post-Training Quantization Based on Prediction Difference Metric☆52Updated last year
- ☆34Updated last year
- This repository has been moved. The new location is in https://github.com/TexasInstruments/edgeai-tensorlab☆71Updated 11 months ago
- ☆44Updated 3 years ago
- DeltaCNN End-to-End CNN Inference of Sparse Frame Differences in Videos☆60Updated 2 years ago
- Make RepVGG Greater Again: A Quantization-aware Approach☆21Updated last year
- The official implementation of the NeurIPS 2022 paper Q-ViT.☆87Updated last year
- ☆75Updated 2 years ago
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆35Updated 2 years ago
- CUda Matrix Multiply library.☆75Updated 2 weeks ago
- Tensorrt-Deformable-Detr☆59Updated 2 years ago
- FakeQuantize with Learned Step Size(LSQ+) as Observer in PyTorch☆33Updated 3 years ago
- [NeurIPS 2023] MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited Memory☆66Updated last year
- A tool convert TensorRT engine/plan to a fake onnx☆38Updated 2 years ago
- This repository describes how to add a custom TensorRT plugin in c++ and python☆28Updated 3 years ago
- A simple tool that can generate TensorRT plugin code quickly.☆228Updated last year
- Inference of quantization aware trained networks using TensorRT☆80Updated 2 years ago
- This is 8-bit quantization sample for yolov5. Both PTQ, QAT and Partial Quantization have been implemented, and present the results based…☆101Updated 2 years ago
- [ECCV 2024] Isomorphic Pruning for Vision Models☆66Updated 7 months ago
- Post-Training Quantization for Vision transformers.☆208Updated 2 years ago
- ☆59Updated 8 months ago
- ICLR2024: LiDAR-PTQ: Post-Training Quantization for Point Cloud 3D Object Detection.☆74Updated 6 months ago
- ☆59Updated 2 years ago
- Slides with modifications for a course at Tsinghua University.☆59Updated 2 years ago