kamalkraj / stable-diffusion-tritonserver
Deploy stable diffusion model with onnx/tenorrt + tritonserver
☆123Updated last year
Alternatives and similar repositories for stable-diffusion-tritonserver:
Users that are interested in stable-diffusion-tritonserver are comparing it to the libraries listed below
- ☆53Updated 2 years ago
- The Triton backend for TensorRT.☆70Updated last week
- Faster generation with text-to-image diffusion models.☆211Updated 5 months ago
- NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that del…☆26Updated last year
- stable diffusion, controlnet, tensorrt, accelerate☆56Updated last year
- TensorRT is a C++ library for high performance inference on NVIDIA GPUs and deep learning accelerators.☆18Updated last year
- Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.☆198Updated 2 months ago
- Context parallel attention that accelerates DiT model inference with dynamic caching☆222Updated this week
- ☆31Updated 2 years ago
- The Triton backend for the ONNX Runtime.☆140Updated last week
- ☆99Updated last year
- ☆113Updated 2 years ago
- Generate long weighted prompt embeddings for Stable Diffusion☆110Updated 6 months ago
- It is a simple library to speed up CLIP inference up to 3x (K80 GPU)☆213Updated last year
- Common source, scripts and utilities shared across all Triton repositories.☆69Updated this week
- Diffusers training with mmengine☆99Updated last year
- ☆238Updated this week
- ONNX-Powered Inference for State-of-the-Art Face Upscalers☆93Updated 7 months ago
- Unofficial implementation. Stable diffusion model trained by AI Feedback-Based Self-Training Direct Preference Optimization.☆61Updated last year
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆334Updated last month
- Inference speed-up for stable-diffusion (ldm) with TensorRT.☆35Updated last year
- Model Compression Toolbox for Large Language Models and Diffusion Models☆388Updated last month
- An efficient implementation of Stable-Diffusion-XL☆46Updated last year
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆133Updated 2 months ago
- Iterable datapipelines for pytorch training.☆81Updated 6 months ago
- Simple example of FastAPI + gRPC AsyncIO + Triton☆63Updated 2 years ago
- Real-time inference for Stable Diffusion - 0.88s latency. Covers AITemplate, nvFuser, TensorRT, FlashAttention. Join our Discord communty…☆556Updated last year
- Simple large-scale training of stable diffusion with multi-node support.☆129Updated last year
- [NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising☆191Updated last month
- Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x fast…☆254Updated 5 months ago