kamalkraj / stable-diffusion-tritonserverLinks
Deploy stable diffusion model with onnx/tenorrt + tritonserver
☆125Updated 2 years ago
Alternatives and similar repositories for stable-diffusion-tritonserver
Users that are interested in stable-diffusion-tritonserver are comparing it to the libraries listed below
Sorting:
- ☆53Updated 2 years ago
- Faster generation with text-to-image diffusion models.☆225Updated last month
- Making Flux go brrr on GPUs.☆131Updated last month
- SSD-1B, an open-source text-to-image model, outperforming previous versions by being 50% smaller and 60% faster than SDXL.☆177Updated last year
- stable diffusion, controlnet, tensorrt, accelerate☆58Updated 2 years ago
- Real-time inference for Stable Diffusion - 0.88s latency. Covers AITemplate, nvFuser, TensorRT, FlashAttention. Join our Discord communty…☆559Updated last year
- ☆101Updated last year
- Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x fast…☆274Updated 10 months ago
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆375Updated 2 months ago
- Generate long weighted prompt embeddings for Stable Diffusion☆134Updated 4 months ago
- It is a simple library to speed up CLIP inference up to 3x (K80 GPU)☆220Updated 2 years ago
- what I learned about fine-tuning stable diffusion☆147Updated 2 years ago
- Quantized stable-diffusion cutting down memory 75%, testing in streamlit, deploying in container☆54Updated this week
- NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that del…☆26Updated 2 years ago
- The Triton backend for TensorRT.☆78Updated 3 weeks ago
- This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.☆155Updated last year
- Simple large-scale training of stable diffusion with multi-node support.☆133Updated 2 years ago
- TensorRT is a C++ library for high performance inference on NVIDIA GPUs and deep learning accelerators.☆19Updated last year
- Unofficial implementation. Stable diffusion model trained by AI Feedback-Based Self-Training Direct Preference Optimization.☆65Updated last year
- ☆121Updated 2 years ago
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated 9 months ago
- Segmind Distilled diffusion☆608Updated last year
- Implementation of Key-Locked Rank One Editing, from Nvidia AI☆235Updated last year
- [NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising☆205Updated 6 months ago
- Diffusion WebUI: Stable Diffusion + ControlNet + Inpaint☆53Updated 2 years ago
- Iterable datapipelines for pytorch training.☆87Updated 11 months ago
- Writing FLUX in Triton☆40Updated 11 months ago
- ☆437Updated last year
- Code for instruction-tuning Stable Diffusion.☆238Updated last year
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆140Updated 7 months ago