kamalkraj / stable-diffusion-tritonserverLinks
Deploy stable diffusion model with onnx/tenorrt + tritonserver
☆123Updated last year
Alternatives and similar repositories for stable-diffusion-tritonserver
Users that are interested in stable-diffusion-tritonserver are comparing it to the libraries listed below
Sorting:
- The Triton backend for TensorRT.☆76Updated 3 weeks ago
- ☆53Updated 2 years ago
- SSD-1B, an open-source text-to-image model, outperforming previous versions by being 50% smaller and 60% faster than SDXL.☆175Updated last year
- stable diffusion, controlnet, tensorrt, accelerate☆56Updated 2 years ago
- NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that del…☆26Updated last year
- An efficient implementation of Stable-Diffusion-XL☆47Updated last year
- ☆99Updated last year
- Faster generation with text-to-image diffusion models.☆214Updated 7 months ago
- Common source, scripts and utilities shared across all Triton repositories.☆72Updated 3 weeks ago
- ONNX-Powered Inference for State-of-the-Art Face Upscalers☆97Updated 10 months ago
- It is a simple library to speed up CLIP inference up to 3x (K80 GPU)☆219Updated last year
- A Toolkit to Help Optimize Onnx Model☆153Updated this week
- The Triton backend for the ONNX Runtime.☆148Updated 3 weeks ago
- Iterable datapipelines for pytorch training.☆83Updated 9 months ago
- ☆31Updated 2 years ago
- Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x fast…☆266Updated 7 months ago
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆358Updated last week
- TensorRT is a C++ library for high performance inference on NVIDIA GPUs and deep learning accelerators.☆19Updated last year
- This is a Gradio WebUI working with the Diffusers format of Stable Diffusion☆80Updated 2 years ago
- Generate long weighted prompt embeddings for Stable Diffusion☆120Updated last month
- Diffusers training with mmengine☆100Updated last year
- Writing FLUX in Triton☆33Updated 8 months ago
- Official Pytorch implementation of "Graphit: A Unified Framework for Diverse Image Editing Tasks"☆201Updated 2 years ago
- ☆97Updated last month
- Quantized stable-diffusion cutting down memory 75%, testing in streamlit, deploying in container☆53Updated last week
- This is the onnxruntime inference code for GFP-GAN: Towards Real-World Blind Face Restoration with Generative Facial Prior (CVPR 2021). …☆144Updated 2 years ago
- Simple large-scale training of stable diffusion with multi-node support.☆133Updated 2 years ago
- A Gradio component that can be used to annotate images with bounding boxes.☆52Updated 3 months ago
- Torchserve + TensorRT + Detection☆19Updated 3 years ago
- The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.☆133Updated 2 weeks ago