IDKiro / sdxsLinks
Official repo of our paper "SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions"
β624Updated last year
Alternatives and similar repositories for sdxs
Users that are interested in sdxs are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement π₯β585Updated 3 weeks ago
- [CVPR'25]Tora: Trajectory-oriented Diffusion Transformer for Video Generationβ1,188Updated last week
- [CVPR 2025 Highlightπ₯] Identity-Preserving Text-to-Video Generation by Frequency Decompositionβ728Updated last month
- [ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.β1,020Updated 10 months ago
- [ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generationβ295Updated last year
- Codes for ID-Specific Video Customized Diffusionβ454Updated last year
- Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025)β696Updated 2 months ago
- The official implementation of RealisDanceβ565Updated 3 weeks ago
- Pytorch Implementation of FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing (ICLR 2024)β204Updated last year
- β405Updated 4 months ago
- UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformerβ722Updated 2 months ago
- [ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"β1,628Updated 7 months ago
- β153Updated last year
- Customized ID Consistent for humanβ969Updated 4 months ago
- Official implementation of "XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation".β523Updated last week
- Code for SCIS-2025 Paper "UniAnimate: Taming Unified Video Diο¬usion Models for Consistent Human Image Animation".β1,151Updated 3 months ago
- Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Modelsβ916Updated 4 months ago
- Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple teβ¦β1,087Updated 5 months ago
- Video-Inpaint-Anything: This is the inference code for our paper CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Cβ¦β298Updated 9 months ago
- [AAAI 2025] Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generationβ159Updated 2 weeks ago
- [ICML 2023 Oral, NeurIPS 2023] Official implementations for paper: Customizable Image Synthesis with Multiple Subjectsβ442Updated last year
- Video generation from text&image, 1st-genβ925Updated 2 months ago
- π₯ [ICCV 2025] Official ComfyUI native node for InfiniteYou with FLUXβ206Updated 2 weeks ago
- β897Updated 7 months ago
- Implementation of "FLUX-Text: A Simple and Advanced Diffusion Transformer Baseline for Scene Text Editing"β277Updated this week
- Rethinking High-Quality Aesthetic Poster Generation in a Unified Frameworkβ466Updated 2 weeks ago
- Official implementation for "Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model" (NeurIPS 2024)β255Updated 2 months ago
- [Arxiv 2024] From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generationβ191Updated 3 months ago
- Liquid: Language Models are Scalable and Unified Multi-modal Generatorsβ603Updated 3 months ago
- [ICLR'25] 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generationβ351Updated 2 weeks ago