IDKiro / sdxsLinks
Official repo of our paper "SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions"
β619Updated last year
Alternatives and similar repositories for sdxs
Users that are interested in sdxs are comparing it to the libraries listed below
Sorting:
- Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement π₯β574Updated 5 months ago
- [CVPR'25]Tora: Trajectory-oriented Diffusion Transformer for Video Generationβ1,154Updated last week
- [CVPR 2025 Highlightπ₯] Identity-Preserving Text-to-Video Generation by Frequency Decompositionβ702Updated last week
- [ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.β1,009Updated 9 months ago
- Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025)β674Updated 3 weeks ago
- Video generation from text&image, 1st-genβ923Updated 3 weeks ago
- The official implementation of RealisDanceβ530Updated last week
- Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Modelsβ911Updated 2 months ago
- [ICLR'25] 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generationβ340Updated 2 weeks ago
- Codes for ID-Specific Video Customized Diffusionβ452Updated last year
- Code for SCIS-2025 Paper "UniAnimate: Taming Unified Video Diο¬usion Models for Consistent Human Image Animation".β1,134Updated last month
- β894Updated 5 months ago
- [ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generationβ293Updated 10 months ago
- β151Updated last year
- Customized ID Consistent for humanβ961Updated 3 months ago
- [ICML 2023 Oral, NeurIPS 2023] Official implementations for paper: Customizable Image Synthesis with Multiple Subjectsβ438Updated last year
- Pytorch Implementation of FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing (ICLR 2024)β205Updated last year
- UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformerβ642Updated last month
- [IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generationβ1,129Updated 6 months ago
- Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple teβ¦β1,080Updated 4 months ago
- [ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"β1,605Updated 5 months ago
- Official implementation for "Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model" (NeurIPS 2024)β252Updated last month
- [NeurIPS 2024] Boosting the performance of consistency models with PCM!β475Updated 5 months ago
- Liquid: Language Models are Scalable and Unified Multi-modal Generatorsβ587Updated last month
- π₯ Official ComfyUI native node for InfiniteYou with FLUXβ153Updated 2 weeks ago
- Training-free Regional Prompting for Diffusion Transformers π₯β646Updated 6 months ago
- [ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!β819Updated 6 months ago
- β435Updated last year
- Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generationβ542Updated 8 months ago
- [ECCV 2024] DragAnything: Motion Control for Anything using Entity Representationβ492Updated 11 months ago