kamalkraj / stable-diffusion-tritonserver
Deploy stable diffusion model with onnx/tenorrt + tritonserver
☆123Updated last year
Related projects ⓘ
Alternatives and complementary repositories for stable-diffusion-tritonserver
- ☆52Updated last year
- Faster generation with text-to-image diffusion models.☆196Updated last month
- stable diffusion, controlnet, tensorrt, accelerate☆54Updated last year
- The Triton backend for TensorRT.☆64Updated this week
- Quantized stable-diffusion cutting down memory 75%, testing in streamlit, deploying in container☆54Updated last week
- ☆102Updated last year
- This is a Gradio WebUI working with the Diffusers format of Stable Diffusion☆79Updated last year
- NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that del…☆25Updated last year
- ☆30Updated 2 years ago
- It is a simple library to speed up CLIP inference up to 3x (K80 GPU)☆203Updated last year
- A general 2-8 bits quantization toolbox with GPTQ/AWQ/HQQ, and export to onnx/onnx-runtime easily.☆150Updated 2 months ago
- Simple large-scale training of stable diffusion with multi-node support.☆126Updated last year
- Iterable datapipelines for pytorch training.☆81Updated 2 months ago
- SSD-1B, an open-source text-to-image model, outperforming previous versions by being 50% smaller and 60% faster than SDXL.☆166Updated 7 months ago
- Instruct-tune LLaMA on consumer hardware☆73Updated last year
- [NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising☆165Updated last month
- what I learned about fine-tuning stable diffusion☆135Updated last year
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆268Updated 2 weeks ago
- Fine-tuning of diffusion models☆97Updated last year
- Generate long weighted prompt embeddings for Stable Diffusion☆84Updated 2 months ago
- ☆193Updated this week
- Model Compression Toolbox for Large Language Models and Diffusion Models☆233Updated 2 weeks ago
- An efficient implementation of Stable-Diffusion-XL☆45Updated last year
- This repository contains tutorials and examples for Triton Inference Server☆568Updated this week
- This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.☆153Updated last year
- Int8 StableFusion model☆41Updated 2 years ago
- [ICCV 2023] Q-Diffusion: Quantizing Diffusion Models.☆331Updated 8 months ago
- Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.☆185Updated 2 months ago
- Repo for my Tensorflow/Keras CV experiments. Mostly revolving around the Danbooru20xx dataset☆116Updated last year
- ☆96Updated last year