kamalkraj / stable-diffusion-tritonserver
Deploy stable diffusion model with onnx/tenorrt + tritonserver
☆122Updated last year
Alternatives and similar repositories for stable-diffusion-tritonserver:
Users that are interested in stable-diffusion-tritonserver are comparing it to the libraries listed below
- ☆52Updated last year
- stable diffusion, controlnet, tensorrt, accelerate☆55Updated last year
- Faster generation with text-to-image diffusion models.☆210Updated 4 months ago
- Diffusers training with mmengine☆99Updated last year
- The Triton backend for TensorRT.☆69Updated this week
- Context parallel attention that accelerates DiT model inference with dynamic caching☆189Updated this week
- It is a simple library to speed up CLIP inference up to 3x (K80 GPU)☆213Updated last year
- Iterable datapipelines for pytorch training.☆81Updated 5 months ago
- TensorRT is a C++ library for high performance inference on NVIDIA GPUs and deep learning accelerators.☆17Updated 11 months ago
- Simple large-scale training of stable diffusion with multi-node support.☆128Updated last year
- Generate long weighted prompt embeddings for Stable Diffusion☆108Updated 4 months ago
- ☆109Updated 2 years ago
- Quantized stable-diffusion cutting down memory 75%, testing in streamlit, deploying in container☆54Updated this week
- NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that del…☆25Updated last year
- [NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising☆179Updated 4 months ago
- Model Compression Toolbox for Large Language Models and Diffusion Models☆328Updated last week
- A Gradio component that can be used to annotate images with bounding boxes.☆43Updated 3 months ago
- Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x fast…☆247Updated 4 months ago
- An efficient implementation of Stable-Diffusion-XL☆45Updated last year
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆410Updated last year
- ONNX-Powered Inference for State-of-the-Art Face Upscalers☆89Updated 6 months ago
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆319Updated this week
- Unofficial implementation. Stable diffusion model trained by AI Feedback-Based Self-Training Direct Preference Optimization.☆60Updated 11 months ago
- ☆30Updated 2 years ago
- ☆98Updated last year
- Comparison of different stable diffusion implementations and optimizations☆38Updated last year
- PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.☆772Updated last week
- Common source, scripts and utilities shared across all Triton repositories.☆68Updated last week
- Real-time inference for Stable Diffusion - 0.88s latency. Covers AITemplate, nvFuser, TensorRT, FlashAttention. Join our Discord communty…☆555Updated last year
- SSD-1B, an open-source text-to-image model, outperforming previous versions by being 50% smaller and 60% faster than SDXL.☆169Updated 10 months ago