NVlabs / HALPLinks
☆69Updated 3 years ago
Alternatives and similar repositories for HALP
Users that are interested in HALP are comparing it to the libraries listed below
Sorting:
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆72Updated 3 years ago
- This project aims to explore the deployment of Swin-Transformer based on TensorRT, including the test results of FP16 and INT8.☆171Updated 2 years ago
- NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.☆218Updated last year
- Deep Learning tools and applications for NVIDIA AGX platforms.☆246Updated 2 months ago
- This repository describes how to add a custom TensorRT plugin in c++ and python☆27Updated 4 years ago
- A tool convert TensorRT engine/plan to a fake onnx☆41Updated 2 years ago
- Offline Quantization Tools for Deploy.☆140Updated last year
- CUda Matrix Multiply library.☆81Updated 4 months ago
- Make RepVGG Greater Again: A Quantization-aware Approach☆28Updated last year
- A neural network training interface based on PyTorch, with a focus on flexibility☆63Updated last year
- A set of examples around MegEngine☆31Updated last year
- ☆36Updated 2 years ago
- FakeQuantize with Learned Step Size(LSQ+) as Observer in PyTorch☆36Updated 3 years ago
- Collect the Performance of 3D Object Detection Methods from Multi-View Camera Images (BEV Perception).☆35Updated 2 years ago
- ☆44Updated 4 years ago
- PyTorch Quantization Aware Training Example☆143Updated last year
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆35Updated 3 years ago
- DeltaCNN End-to-End CNN Inference of Sparse Frame Differences in Videos☆59Updated 2 years ago
- Efficient GPU kernels for mixed-precision Vision Transformers in Triton☆15Updated last month
- A simple tool that can generate TensorRT plugin code quickly.☆236Updated 2 years ago
- [ECCV 2022] DID-M3D: Decoupling Instance Depth for Monocular 3D Object Detection.☆77Updated last year
- ☆35Updated 2 years ago
- ☆61Updated last year
- [NeurIPS 2023] MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited Memory☆72Updated last year
- Count number of parameters / MACs / FLOPS for ONNX models.☆94Updated last year
- [CVPR 2024] PTQ4SAM: Post-Training Quantization for Segment Anything☆81Updated last year
- Useful tensorrt plugin. For pytorch and mmdetection model conversion.☆165Updated last year
- [CVPR 2023] PD-Quant: Post-Training Quantization Based on Prediction Difference Metric☆58Updated 2 years ago
- ☆76Updated 3 years ago
- Pytorch implementation of RAPQ, IJCAI 2022☆22Updated 2 years ago