bytedance / XVerseLinks
[NeurIPS 2025] Official implementation of "XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation".
β615Updated last month
Alternatives and similar repositories for XVerse
Users that are interested in XVerse are comparing it to the libraries listed below
Sorting:
- Implementation of "FLUX-Text: A Simple and Advanced Diffusion Transformer Baseline for Scene Text Editing"β416Updated 2 weeks ago
- [CVPR 2025 Highlightπ₯] Identity-Preserving Text-to-Video Generation by Frequency Decompositionβ786Updated 3 months ago
- UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformerβ817Updated 7 months ago
- Calligrapher: Freestyle Text Image Customizationβ294Updated 3 months ago
- Official project page of MTVCrafter, a new paradigm for animating arbitrary characters with 4D motion tokens.β269Updated 3 weeks ago
- β278Updated 4 months ago
- Rethinking High-Quality Aesthetic Poster Generation in a Unified Frameworkβ506Updated 2 months ago
- [TIP 2025] From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generationβ196Updated 2 months ago
- Official code for AccVideo: Accelerating Video Diffusion Model with Synthetic Datasetβ269Updated 6 months ago
- The official implementation of RealisDanceβ608Updated 5 months ago
- [AAAI 2025] Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generationβ166Updated 5 months ago
- π₯ [ICCV 2025 Highlight] Official ComfyUI native node supporting InfiniteYou with FLUXβ280Updated 4 months ago
- The official code implementation of the paper "OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data."β415Updated 6 months ago
- Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Cachingβ271Updated 3 months ago
- We achieves high-quality first-frame guided video editing given a reference image, while maintaining flexibility for incorporating additiβ¦β318Updated 3 months ago
- Official implementation of MAGREF: Masked Guidance for Any-Reference Video Generation with Subject Disentanglementβ279Updated 2 months ago
- β416Updated 9 months ago
- [ICCV 2025] Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement π₯β615Updated 5 months ago
- Stand-In is a lightweight, plug-and-play framework for identity-preserving video generation.β687Updated 3 months ago
- Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025) and "UltraViCo: Bβ¦β757Updated this week
- [Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Datasetβ509Updated last month
- Pusa: Thousands Timesteps Video Diffusion Modelβ666Updated 3 months ago
- Towards Real-Time Diffusion-Based Streaming Video Super-Resolution β An efficient one-step diffusion framework for streaming VSR with locβ¦β1,048Updated 2 weeks ago
- FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformersβ485Updated 3 months ago
- [SIGGRAPH 2025] Official code of the paper "FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios"β342Updated last month
- β95Updated last month
- [CVPR-2025] The official code of HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animationβ314Updated 6 months ago
- [SIGGRAPH Asia 25] Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Offβ323Updated last month
- Official comfyui repository of Hellomemeβ372Updated 5 months ago
- ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulationβ614Updated 2 weeks ago