IDKiro / sdxsLinks
Official repo of our paper "SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions"
β635Updated last year
Alternatives and similar repositories for sdxs
Users that are interested in sdxs are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement π₯β605Updated 2 months ago
- [CVPR'25]Tora: Trajectory-oriented Diffusion Transformer for Video Generationβ1,198Updated 2 months ago
- [CVPR 2025 Highlightπ₯] Identity-Preserving Text-to-Video Generation by Frequency Decompositionβ762Updated 3 weeks ago
- [ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.β1,024Updated last year
- Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025)β720Updated 4 months ago
- UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformerβ789Updated 4 months ago
- [ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generationβ296Updated last year
- Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Modelsβ914Updated 6 months ago
- β412Updated 6 months ago
- Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modelingβ1,068Updated 2 weeks ago
- Codes for ID-Specific Video Customized Diffusionβ458Updated last year
- Pytorch Implementation of FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing (ICLR 2024)β206Updated last year
- Video generation from text&image, 1st-genβ922Updated 4 months ago
- Implementation of "FLUX-Text: A Simple and Advanced Diffusion Transformer Baseline for Scene Text Editing"β393Updated last month
- The official implementation of RealisDanceβ597Updated 3 months ago
- Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple teβ¦β1,095Updated 7 months ago
- Official implementation of "XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation".β593Updated last month
- π₯ [ICCV 2025 Highlight] Official ComfyUI native node supporting InfiniteYou with FLUXβ268Updated last month
- Video-Inpaint-Anything: This is the inference code for our paper CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Cβ¦β305Updated 11 months ago
- [AAAI 2025] Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generationβ165Updated 2 months ago
- [ICML 2023 Oral, NeurIPS 2023] Official implementations for paper: Customizable Image Synthesis with Multiple Subjectsβ443Updated 2 years ago
- Rethinking High-Quality Aesthetic Poster Generation in a Unified Frameworkβ495Updated 2 months ago
- Customized ID Consistent for humanβ970Updated 7 months ago
- β154Updated last year
- Code for SCIS-2025 Paper "UniAnimate: Taming Unified Video Diο¬usion Models for Consistent Human Image Animation".β1,166Updated 5 months ago
- [ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"β1,658Updated 9 months ago
- β897Updated 9 months ago
- Unified Multimodal Model for image generation/editing/understandingβ775Updated last week
- Liquid: Language Models are Scalable and Unified Multi-modal Generatorsβ615Updated 5 months ago
- [IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generationβ1,132Updated last week