leimao / PyTorch-Eager-Mode-Quantization-TensorRT-AccelerationLinks
TensorRT Acceleration for PyTorch Native Eager Mode Quantization Models
☆15Updated 10 months ago
Alternatives and similar repositories for PyTorch-Eager-Mode-Quantization-TensorRT-Acceleration
Users that are interested in PyTorch-Eager-Mode-Quantization-TensorRT-Acceleration are comparing it to the libraries listed below
Sorting:
- Memory-Efficient CUDA kernels for training ConvNets with PyTorch.☆41Updated 3 months ago
- PyTorch Pruning Example☆50Updated 2 years ago
- Converting weights of Pytorch models to ONNX & TensorRT engines☆49Updated 2 years ago
- ☆31Updated 11 months ago
- Nsight Systems In Docker☆20Updated last year
- A faster implementation of OpenCV-CUDA that uses OpenCV objects, and more!☆51Updated last week
- EfficientViT is a new family of vision models for efficient high-resolution vision.☆26Updated last year
- A tool convert TensorRT engine/plan to a fake onnx☆39Updated 2 years ago
- HunyuanDiT with TensorRT and libtorch☆17Updated last year
- Context Manager to profile the forward and backward times of PyTorch's nn.Module☆83Updated last year
- Patch convolution to avoid large GPU memory usage of Conv2D☆87Updated 4 months ago
- Timm model explorer☆39Updated last year
- Simple and easy stable diffusion inference with LightningModule on GPU, CPU and MPS (Possibly all devices supported by Lightning).☆17Updated last year
- VIT inference in triton because, why not?☆28Updated last year
- Easily benchmark PyTorch model FLOPs, latency, throughput, allocated gpu memory and energy consumption☆103Updated last year
- This library empowers users to seamlessly port pretrained models and checkpoints on the HuggingFace (HF) hub (developed using HF transfor…☆68Updated this week
- DeltaCNN End-to-End CNN Inference of Sparse Frame Differences in Videos☆59Updated 2 years ago
- Model compression for ONNX☆96Updated 6 months ago
- Official implementation of "Active Image Indexing"☆59Updated 2 years ago
- ☆40Updated last year
- Little article showing how to load pytorch's models with linear memory consumption☆34Updated 2 years ago
- A block oriented training approach for inference time optimization.☆33Updated 9 months ago
- Estimate dataset difficulty and detect label mistakes using reconstruction error ratios!☆25Updated 4 months ago
- Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.☆127Updated this week
- Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, …☆69Updated 3 weeks ago
- The Triton backend for TensorRT.☆76Updated 3 weeks ago
- Awesome code, projects, books, etc. related to CUDA☆17Updated last month
- ☆49Updated last year
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆120Updated 10 months ago
- ☆31Updated 2 years ago