rajeevsrao / TensorRT
TensorRT is a C++ library for high performance inference on NVIDIA GPUs and deep learning accelerators.
☆19Updated last year
Alternatives and similar repositories for TensorRT:
Users that are interested in TensorRT are comparing it to the libraries listed below
- stable diffusion, controlnet, tensorrt, accelerate☆56Updated last year
- https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching☆255Updated 3 weeks ago
- Faster generation with text-to-image diffusion models.☆213Updated 6 months ago
- ☆99Updated last year
- Generate long weighted prompt embeddings for Stable Diffusion☆113Updated this week
- [NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising☆197Updated 2 months ago
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆342Updated 2 months ago
- Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x fast…☆263Updated 6 months ago
- Official Repository of the paper "Trajectory Consistency Distillation"☆336Updated 11 months ago
- [ICLR 2025] FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality☆210Updated 3 months ago
- An efficient implementation of Stable-Diffusion-XL☆46Updated last year
- SSD-1B, an open-source text-to-image model, outperforming previous versions by being 50% smaller and 60% faster than SDXL.☆175Updated last year
- Put Your Face Everywhere in Seconds.☆312Updated last year
- ☆427Updated last year
- Experimental usage of stable-fast and TensorRT.☆208Updated 8 months ago
- An initiative to replicate Sora☆104Updated last year
- Accelerates Flux.1 image generation, just by using this node.☆129Updated 4 months ago
- implementation of the IPAdapter models for HF Diffusers☆172Updated last year
- ☆55Updated last year
- Optimum version of a UI for Stable Diffusion, running on ONNX models for faster inference, working on most common GPU vendors: NVIDIA,AMD…☆24Updated last year
- Adapted from https://note.com/kohya_ss/n/nbf7ce8d80f29 for easier cloning☆28Updated last year
- A diffusers based implementation of HyperDreamBooth☆132Updated last year
- [NeurIPS 2024] Boosting the performance of consistency models with PCM!☆465Updated 4 months ago
- ☆106Updated last week
- Deploy stable diffusion model with onnx/tenorrt + tritonserver☆123Updated last year
- T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!☆394Updated 2 months ago
- Model Compression Toolbox for Large Language Models and Diffusion Models☆435Updated 3 weeks ago
- IP Adapter Instruct☆204Updated 8 months ago
- ONNX-Powered Inference for State-of-the-Art Face Upscalers☆97Updated 9 months ago
- ☆315Updated last year