bytedance / XVerseLinks
[NeurIPS 2025] Official implementation of "XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation".
β617Updated 2 months ago
Alternatives and similar repositories for XVerse
Users that are interested in XVerse are comparing it to the libraries listed below
Sorting:
- [CVPR 2025 Highlightπ₯] Identity-Preserving Text-to-Video Generation by Frequency Decompositionβ797Updated 4 months ago
- Implementation of "FLUX-Text: A Simple and Advanced Diffusion Transformer Baseline for Scene Text Editing"β428Updated last month
- UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformerβ829Updated 8 months ago
- [AAAI 2025] Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generationβ170Updated 6 months ago
- Rethinking High-Quality Aesthetic Poster Generation in a Unified Frameworkβ522Updated 3 months ago
- The official implementation of RealisDanceβ607Updated 7 months ago
- β280Updated 5 months ago
- [ICCV 2025] Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement π₯β620Updated last month
- [TIP 2025] From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generationβ195Updated 4 months ago
- Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025) and "UltraViCo: Bβ¦β774Updated last month
- Official project page of MTVCrafter, a new paradigm for animating arbitrary characters with 4D motion tokens.β276Updated 2 months ago
- β415Updated 10 months ago
- π₯ [ICCV 2025 Highlight] Official ComfyUI native node supporting InfiniteYou with FLUXβ281Updated 5 months ago
- Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Cachingβ278Updated 4 months ago
- AnyTalker: Scaling Multi-person Talking Video Generation with Interactivity Refinementβ252Updated last month
- [NeurIPS 2025 D&Bπ₯] OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generationβ190Updated 2 weeks ago
- The official code implementation of the paper "OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data."β425Updated 7 months ago
- Pusa: Thousands Timesteps Video Diffusion Modelβ671Updated 4 months ago
- [Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Datasetβ556Updated 2 months ago
- FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformersβ493Updated 5 months ago
- Calligrapher: Freestyle Text Image Customizationβ295Updated 4 months ago
- Official implementation of MAGREF: Masked Guidance for Any-Reference Video Generation with Subject Disentanglementβ282Updated last week
- [ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generationβ298Updated last year
- Official code for StoryMem: Multi-shot Long Video Storytelling with Memoryβ607Updated 3 weeks ago
- Echo-4oβ474Updated last month
- We achieves high-quality first-frame guided video editing given a reference image, while maintaining flexibility for incorporating additiβ¦β322Updated 5 months ago
- Codes for ID-Specific Video Customized Diffusionβ462Updated last year
- Official code for AccVideo: Accelerating Video Diffusion Model with Synthetic Datasetβ269Updated 7 months ago
- Towards Real-Time Diffusion-Based Streaming Video Super-Resolution β An efficient one-step diffusion framework for streaming VSR with locβ¦β1,220Updated 3 weeks ago
- [CVPR'25]Tora: Trajectory-oriented Diffusion Transformer for Video Generationβ1,226Updated 6 months ago