NVIDIA / TensorRT-RTXLinks
NVIDIA TensorRT-RTX is an SDK for high-performance AI inference on NVIDIA RTX GPUs. This repository contains Open-Source Software components of TensorRT-RTX.
☆69Updated 3 weeks ago
Alternatives and similar repositories for TensorRT-RTX
Users that are interested in TensorRT-RTX are comparing it to the libraries listed below
Sorting:
- HunyuanDiT with TensorRT and libtorch☆18Updated last year
- A Toolkit to Help Optimize Onnx Model☆276Updated this week
- A faster implementation of OpenCV-CUDA that uses OpenCV objects, and more!☆54Updated last month
- C++ pipeline with OpenVINO native API for Stable Diffusion v1.5☆13Updated last year
- Memory Management for the GPU Poor, run the latest open source frontier models on consumer Nvidia GPUs☆161Updated 3 weeks ago
- Model compression for ONNX☆99Updated last year
- ☆187Updated last week
- DeepStream Libraries offer CVCUDA, NvImageCodec, and PyNvVideoCodec modules as Python APIs for seamless integration into custom framewor…☆71Updated 3 months ago
- https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching☆403Updated 5 months ago
- Model Compression Toolbox for Large Language Models and Diffusion Models☆710Updated 4 months ago
- faster parallel inference of mochi-1 video generation model☆126Updated 9 months ago
- PyTorch half precision gemm lib w/ fused optional bias + optional relu/gelu☆75Updated last year
- A nvImageCodec library of GPU- and CPU- accelerated codecs featuring a unified interface☆127Updated 2 weeks ago
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆388Updated 6 months ago
- stable diffusion, controlnet, tensorrt, accelerate☆58Updated 2 years ago
- The NVIDIA RTX™ AI Toolkit is a suite of tools and SDKs for Windows developers to customize, optimize, and deploy AI models across RTX PC…☆180Updated 3 weeks ago
- [NeurIPS'25] One-Step Diffusion for Detail-Rich and Temporally Consistent Video Super-Resolution☆320Updated 3 weeks ago
- A Toolkit to Help Optimize Large Onnx Model☆162Updated last month
- Fast and memory-efficient exact attention☆24Updated last year
- Qwen-Image-Lightning: Speed up Qwen-Image model with distillation☆1,044Updated 2 weeks ago
- A Gradio web UI for Depth-Pro, Sharp Monocular Metric Depth Estimation☆54Updated last year
- DFloat11 [NeurIPS '25]: Lossless Compression of LLMs and DiTs for Efficient GPU Inference☆572Updated 3 weeks ago
- ☆282Updated 11 months ago
- 天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛 初赛第三名方案☆50Updated 2 years ago
- 大模型部署实战:TensorRT-LLM, Triton Inference Server, vLLM☆26Updated last year
- cache-dit for comfyui☆25Updated 2 months ago
- Stable Diffusion in TensorRT 8.5+☆15Updated 2 years ago
- External project in GitHub for marketing purposes. This repo will be used for code samples that accompany blog posts on https://stability…☆13Updated 7 months ago
- This repository provides optical character detection and recognition solution optimized on Nvidia devices.☆83Updated 7 months ago
- ☆194Updated 6 months ago