IDKiro / sdxsLinks
Official repo of our paper "SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions"
β628Updated last year
Alternatives and similar repositories for sdxs
Users that are interested in sdxs are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement π₯β593Updated last month
- [CVPR'25]Tora: Trajectory-oriented Diffusion Transformer for Video Generationβ1,195Updated last month
- [ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.β1,022Updated 11 months ago
- [CVPR 2025 Highlightπ₯] Identity-Preserving Text-to-Video Generation by Frequency Decompositionβ741Updated 2 months ago
- Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025)β704Updated 2 months ago
- β409Updated 4 months ago
- UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformerβ755Updated 3 months ago
- [ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generationβ295Updated last year
- Codes for ID-Specific Video Customized Diffusionβ455Updated last year
- Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Modelsβ915Updated 4 months ago
- Pytorch Implementation of FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing (ICLR 2024)β204Updated last year
- The official implementation of RealisDanceβ588Updated last month
- Customized ID Consistent for humanβ970Updated 5 months ago
- Official implementation of "XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation".β555Updated 2 weeks ago
- β153Updated last year
- [ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"β1,643Updated 7 months ago
- Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple teβ¦β1,089Updated 6 months ago
- [ICML 2023 Oral, NeurIPS 2023] Official implementations for paper: Customizable Image Synthesis with Multiple Subjectsβ444Updated last year
- Implementation of "FLUX-Text: A Simple and Advanced Diffusion Transformer Baseline for Scene Text Editing"β357Updated 2 weeks ago
- Video generation from text&image, 1st-genβ925Updated 2 months ago
- Video-Inpaint-Anything: This is the inference code for our paper CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Cβ¦β300Updated 10 months ago
- Rethinking High-Quality Aesthetic Poster Generation in a Unified Frameworkβ482Updated last month
- β899Updated 7 months ago
- Code for SCIS-2025 Paper "UniAnimate: Taming Unified Video Diο¬usion Models for Consistent Human Image Animation".β1,156Updated 3 months ago
- [AAAI 2025] Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generationβ161Updated last month
- [TIP 2025] From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generationβ191Updated 3 months ago
- Official implementation for "Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model" (NeurIPS 2024)β255Updated 3 months ago
- π₯ [ICCV 2025 Highlight] Official ComfyUI native node supporting InfiniteYou with FLUXβ246Updated 2 weeks ago
- Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generationβ551Updated 10 months ago
- [ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!β826Updated 8 months ago