leimao / PyTorch-Eager-Mode-Quantization-TensorRT-AccelerationLinks
TensorRT Acceleration for PyTorch Native Eager Mode Quantization Models
☆15Updated 11 months ago
Alternatives and similar repositories for PyTorch-Eager-Mode-Quantization-TensorRT-Acceleration
Users that are interested in PyTorch-Eager-Mode-Quantization-TensorRT-Acceleration are comparing it to the libraries listed below
Sorting:
- Memory-Efficient CUDA kernels for training ConvNets with PyTorch.☆41Updated 4 months ago
- A faster implementation of OpenCV-CUDA that uses OpenCV objects, and more!☆51Updated last week
- ☆32Updated 2 weeks ago
- PyTorch Pruning Example☆50Updated 2 years ago
- Easily benchmark PyTorch model FLOPs, latency, throughput, allocated gpu memory and energy consumption☆103Updated last year
- Nsight Systems In Docker☆20Updated last year
- Restorers provide out-of-the-box TensorFlow implementations of SoTA image and video restoration models for tasks such as low-light enhanc…☆38Updated last year
- Model compression for ONNX☆96Updated 7 months ago
- ☆35Updated 2 years ago
- DeltaCNN End-to-End CNN Inference of Sparse Frame Differences in Videos☆59Updated 2 years ago
- Page for the CVPR 2023 Tutorial - Efficient Neural Networks: From Algorithm Design to Practical Mobile Deployments☆12Updated last year
- Estimate dataset difficulty and detect label mistakes using reconstruction error ratios!☆25Updated 5 months ago
- EfficientViT is a new family of vision models for efficient high-resolution vision.☆26Updated last year
- A tool convert TensorRT engine/plan to a fake onnx☆39Updated 2 years ago
- Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, …☆70Updated last month
- Count number of parameters / MACs / FLOPS for ONNX models.☆93Updated 8 months ago
- A nvImageCodec library of GPU- and CPU- accelerated codecs featuring a unified interface☆110Updated 3 months ago
- This library empowers users to seamlessly port pretrained models and checkpoints on the HuggingFace (HF) hub (developed using HF transfor…☆71Updated this week
- Repo for event-based binary image reconstruction.☆33Updated last year
- Advanced inference pipeline using NVIDIA Triton Inference Server for CRAFT Text detection (Pytorch), included converter from Pytorch -> O…☆33Updated 3 years ago
- The Triton backend for TensorRT.☆77Updated last week
- A Toolkit to Help Optimize Onnx Model☆161Updated this week
- Generalist YOLO: Towards Real-Time End-to-End Multi-Task Visual Language Models☆75Updated last month
- Zero-label image classification via OpenCLIP knowledge distillation☆127Updated last year
- SandLogic Lexicons☆19Updated 8 months ago
- ContourFormer:Real-Time Contour-Based End-to-End Instance Segmentation Transformer☆16Updated last month
- Hacks for PyTorch☆19Updated 2 years ago
- This repository describes how to add a custom TensorRT plugin in c++ and python☆28Updated 4 years ago
- Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.☆167Updated this week
- Awesome code, projects, books, etc. related to CUDA☆17Updated last week