bytedance / XVerseLinks
[NeurIPS 2025] Official implementation of "XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation".
β606Updated last week
Alternatives and similar repositories for XVerse
Users that are interested in XVerse are comparing it to the libraries listed below
Sorting:
- Implementation of "FLUX-Text: A Simple and Advanced Diffusion Transformer Baseline for Scene Text Editing"β406Updated 2 weeks ago
- [CVPR 2025 Highlightπ₯] Identity-Preserving Text-to-Video Generation by Frequency Decompositionβ775Updated last month
- UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformerβ805Updated 6 months ago
- Rethinking High-Quality Aesthetic Poster Generation in a Unified Frameworkβ496Updated last month
- [TIP 2025] From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generationβ195Updated last month
- [AAAI 2025] Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generationβ166Updated 3 months ago
- β269Updated 3 months ago
- Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Cachingβ256Updated 2 months ago
- [ICCV 2025] Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement π₯β610Updated 4 months ago
- The official code implementation of the paper "OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data."β405Updated 4 months ago
- β412Updated 7 months ago
- Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025)β732Updated 5 months ago
- π₯ [ICCV 2025 Highlight] Official ComfyUI native node supporting InfiniteYou with FLUXβ281Updated 3 months ago
- The official implementation of RealisDanceβ605Updated 4 months ago
- Official project page of MTVCrafter, a new paradigm for animating arbitrary characters with 4D motion tokens.β264Updated 2 months ago
- Official implementation of MAGREF: Masked Guidance for Any-Reference Video Generation with Subject Disentanglementβ267Updated 2 weeks ago
- [Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Datasetβ177Updated last week
- Stand-In is a lightweight, plug-and-play framework for identity-preserving video generation.β658Updated last month
- Official code for AccVideo: Accelerating Video Diffusion Model with Synthetic Datasetβ267Updated 4 months ago
- Calligrapher: Freestyle Text Image Customizationβ293Updated last month
- Codes for ID-Specific Video Customized Diffusionβ457Updated last year
- HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioningβ753Updated last week
- [SIGGRAPH 2025] Official code of the paper "FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios"β338Updated 2 months ago
- Official implementation for "Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model" (NeurIPS 2024)β256Updated 6 months ago
- FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformersβ473Updated 2 months ago
- Official comfyui repository of Hellomemeβ370Updated 4 months ago
- [ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generationβ297Updated last year
- [CVPR-2025] The official code of HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animationβ307Updated 4 months ago
- Pusa: Thousands Timesteps Video Diffusion Modelβ659Updated last month
- We achieves high-quality first-frame guided video editing given a reference image, while maintaining flexibility for incorporating additiβ¦β317Updated 2 months ago