bytedance / XVerseLinks
[NeurIPS 2025] Official implementation of "XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation".
β600Updated last week
Alternatives and similar repositories for XVerse
Users that are interested in XVerse are comparing it to the libraries listed below
Sorting:
- [CVPR 2025 Highlightπ₯] Identity-Preserving Text-to-Video Generation by Frequency Decompositionβ767Updated last month
- Implementation of "FLUX-Text: A Simple and Advanced Diffusion Transformer Baseline for Scene Text Editing"β399Updated 2 months ago
- UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformerβ797Updated 5 months ago
- The official implementation of RealisDanceβ599Updated 3 months ago
- Rethinking High-Quality Aesthetic Poster Generation in a Unified Frameworkβ493Updated 2 weeks ago
- Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025)β726Updated 4 months ago
- [ICCV 2025] Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement π₯β605Updated 3 months ago
- β263Updated 2 months ago
- [TIP 2025] From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generationβ195Updated 2 weeks ago
- [AAAI 2025] Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generationβ165Updated 3 months ago
- β412Updated 6 months ago
- Official project page of MTVCrafter, a new paradigm for animating arbitrary characters with 4D motion tokens.β259Updated last month
- Official implementation of MAGREF: Masked Guidance for Any-Reference Video Generationβ264Updated 2 months ago
- We achieves high-quality first-frame guided video editing given a reference image, while maintaining flexibility for incorporating additiβ¦β311Updated last month
- π₯ [ICCV 2025 Highlight] Official ComfyUI native node supporting InfiniteYou with FLUXβ273Updated 2 months ago
- The official code implementation of the paper "OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data."β404Updated 4 months ago
- Calligrapher: Freestyle Text Image Customizationβ291Updated last month
- Official code for AccVideo: Accelerating Video Diffusion Model with Synthetic Datasetβ265Updated 3 months ago
- [Official] Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Offβ317Updated last month
- Pusa: Thousands Timesteps Video Diffusion Modelβ649Updated last month
- Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Cachingβ252Updated last month
- FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformersβ467Updated last month
- [SIGGRAPH 2025] Official code of the paper "FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios"β334Updated last month
- HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioningβ636Updated 2 weeks ago
- Mobius: Text to Seamless Looping Video Generation via Latent Shiftβ163Updated 5 months ago
- Official comfyui repository of Hellomemeβ370Updated 3 months ago
- [CVPR'25]Tora: Trajectory-oriented Diffusion Transformer for Video Generationβ1,202Updated 2 months ago
- β93Updated 3 months ago
- β269Updated 3 weeks ago
- MoviiGen 1.1: Towards Cinematic-Quality Video Generative Modelsβ178Updated 2 months ago