NVlabs / HALPLinks
☆70Updated 3 years ago
Alternatives and similar repositories for HALP
Users that are interested in HALP are comparing it to the libraries listed below
Sorting:
- Offline Quantization Tools for Deploy.☆141Updated last year
- This project aims to explore the deployment of Swin-Transformer based on TensorRT, including the test results of FP16 and INT8.☆174Updated 3 years ago
- Make RepVGG Greater Again: A Quantization-aware Approach☆28Updated last year
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆72Updated 3 years ago
- ☆36Updated 2 years ago
- ☆35Updated 2 years ago
- FakeQuantize with Learned Step Size(LSQ+) as Observer in PyTorch☆37Updated 4 years ago
- [CVPR 2023] PD-Quant: Post-Training Quantization Based on Prediction Difference Metric☆60Updated 2 years ago
- CUda Matrix Multiply library.☆83Updated 6 months ago
- ☆44Updated 4 years ago
- A set of examples around MegEngine☆31Updated 2 years ago
- This repository describes how to add a custom TensorRT plugin in c++ and python☆29Updated 4 years ago
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆35Updated 3 years ago
- A tool convert TensorRT engine/plan to a fake onnx☆41Updated 3 years ago
- DeltaCNN End-to-End CNN Inference of Sparse Frame Differences in Videos☆59Updated 2 years ago
- NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.☆224Updated last year
- Count number of parameters / MACs / FLOPS for ONNX models.☆95Updated last year
- ☆61Updated last year
- Efficient GPU kernels for mixed-precision Vision Transformers in Triton☆15Updated 3 months ago
- Useful tensorrt plugin. For pytorch and mmdetection model conversion.☆165Updated last year
- Deep Learning tools and applications for NVIDIA AGX platforms.☆256Updated 4 months ago
- [CVPR 2024] PTQ4SAM: Post-Training Quantization for Segment Anything☆81Updated last year
- [ECCV 2024] Isomorphic Pruning for Vision Models☆78Updated last year
- A simple tool that can generate TensorRT plugin code quickly.☆238Updated 2 years ago
- TVMScript kernel for deformable attention☆25Updated 4 years ago
- Tensorrt-Deformable-Detr☆63Updated 2 years ago
- Pytorch implementation of RAPQ, IJCAI 2022☆23Updated 2 years ago
- 基于TensorRT7实现DCNv2插件☆48Updated 3 years ago
- Repository for submodules containing code for MMAR 2023 "Detection-segmentation convolutional neural network for autonomous vehicle perc…☆29Updated 2 years ago
- A neural network training interface based on PyTorch, with a focus on flexibility☆63Updated last year