kamalkraj / stable-diffusion-tritonserverLinks
Deploy stable diffusion model with onnx/tenorrt + tritonserver
☆126Updated 2 years ago
Alternatives and similar repositories for stable-diffusion-tritonserver
Users that are interested in stable-diffusion-tritonserver are comparing it to the libraries listed below
Sorting:
- ☆53Updated 2 years ago
- Faster generation with text-to-image diffusion models.☆226Updated 2 months ago
- The Triton backend for TensorRT.☆78Updated last week
- Making Flux go brrr on GPUs.☆138Updated last month
- ☆101Updated last year
- It is a simple library to speed up CLIP inference up to 3x (K80 GPU)☆223Updated 2 years ago
- SSD-1B, an open-source text-to-image model, outperforming previous versions by being 50% smaller and 60% faster than SDXL.☆177Updated last year
- Generate long weighted prompt embeddings for Stable Diffusion☆135Updated 4 months ago
- This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.☆156Updated last year
- stable diffusion, controlnet, tensorrt, accelerate☆58Updated 2 years ago
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆378Updated 3 months ago
- NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that del…☆26Updated 2 years ago
- Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x fast…☆276Updated 11 months ago
- what I learned about fine-tuning stable diffusion☆147Updated 2 years ago
- Real-time inference for Stable Diffusion - 0.88s latency. Covers AITemplate, nvFuser, TensorRT, FlashAttention. Join our Discord communty…☆560Updated last year
- Unofficial implementation. Stable diffusion model trained by AI Feedback-Based Self-Training Direct Preference Optimization.☆65Updated last year
- Quantized stable-diffusion cutting down memory 75%, testing in streamlit, deploying in container☆54Updated 3 weeks ago
- ☆435Updated last year
- Segmind Distilled diffusion☆610Updated last year
- Diffusion WebUI: Stable Diffusion + ControlNet + Inpaint☆53Updated 2 years ago
- Implementation of Key-Locked Rank One Editing, from Nvidia AI☆235Updated 2 years ago
- Code for instruction-tuning Stable Diffusion.☆239Updated last year
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆141Updated 7 months ago
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆426Updated 2 years ago
- ☆55Updated last year
- Writing FLUX in Triton☆40Updated 11 months ago
- TensorRT is a C++ library for high performance inference on NVIDIA GPUs and deep learning accelerators.☆19Updated last year
- [NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising☆205Updated 6 months ago
- Fine-tuning of diffusion models☆99Updated 2 years ago
- Comparison of different stable diffusion implementations and optimizations☆39Updated last year