kamalkraj / stable-diffusion-tritonserver
Deploy stable diffusion model with onnx/tenorrt + tritonserver
☆119Updated last year
Related projects: ⓘ
- ☆51Updated last year
- Faster generation with text-to-image diffusion models.☆181Updated 4 months ago
- ☆101Updated last year
- The Triton backend for TensorRT.☆59Updated last week
- It is a simple library to speed up CLIP inference up to 3x (K80 GPU)☆190Updated last year
- Quantized stable-diffusion cutting down memory 75%, testing in streamlit, deploying in container☆55Updated 2 weeks ago
- stable diffusion, controlnet, tensorrt, accelerate☆52Updated last year
- NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that del…☆24Updated last year
- ☆97Updated 11 months ago
- This is a Gradio WebUI working with the Diffusers format of Stable Diffusion☆78Updated last year
- Simple large-scale training of stable diffusion with multi-node support.☆122Updated last year
- Official implementation of "AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising"☆144Updated last month
- A general 2-8 bits quantization toolbox with GPTQ/AWQ/HQQ, and export to onnx/onnx-runtime easily.☆141Updated 3 weeks ago
- TensorRT is a C++ library for high performance inference on NVIDIA GPUs and deep learning accelerators.☆15Updated 6 months ago
- SSD-1B, an open-source text-to-image model, outperforming previous versions by being 50% smaller and 60% faster than SDXL.☆164Updated 5 months ago
- A Compressed Stable Diffusion for Efficient Text-to-Image Generation [ECCV'24]☆243Updated 2 months ago
- Diffusers training with mmengine☆92Updated 7 months ago
- Iterable datapipelines for pytorch training.☆78Updated 3 weeks ago
- Common source, scripts and utilities shared across all Triton repositories.☆62Updated 2 weeks ago
- Code for instruction-tuning Stable Diffusion.☆190Updated 7 months ago
- Real-time inference for Stable Diffusion - 0.88s latency. Covers AITemplate, nvFuser, TensorRT, FlashAttention. Join our Discord communty…☆552Updated 9 months ago
- ☆29Updated 2 years ago
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆124Updated 3 months ago
- An efficient implementation of Stable-Diffusion-XL☆45Updated 11 months ago
- ☆170Updated this week
- The Triton backend for the ONNX Runtime.☆122Updated this week
- Official Pytorch implementation of "Graphit: A Unified Framework for Diverse Image Editing Tasks"☆203Updated last year
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆381Updated last year
- Diffusion Reinforcement Learning Library☆171Updated 7 months ago
- Easy and Efficient Quantization for Transformers☆172Updated 2 months ago