bytedance / XVerseLinks
[NeurIPS 2025] Official implementation of "XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation".
β613Updated 3 weeks ago
Alternatives and similar repositories for XVerse
Users that are interested in XVerse are comparing it to the libraries listed below
Sorting:
- UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformerβ810Updated 6 months ago
- [CVPR 2025 Highlightπ₯] Identity-Preserving Text-to-Video Generation by Frequency Decompositionβ782Updated 2 months ago
- Implementation of "FLUX-Text: A Simple and Advanced Diffusion Transformer Baseline for Scene Text Editing"β415Updated 3 weeks ago
- The official implementation of RealisDanceβ605Updated 5 months ago
- Rethinking High-Quality Aesthetic Poster Generation in a Unified Frameworkβ496Updated last month
- [TIP 2025] From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generationβ196Updated last month
- [ICCV 2025] Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement π₯β615Updated 4 months ago
- Official project page of MTVCrafter, a new paradigm for animating arbitrary characters with 4D motion tokens.β265Updated last week
- Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025)β736Updated 6 months ago
- β275Updated 3 months ago
- [AAAI 2025] Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generationβ166Updated 4 months ago
- Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Cachingβ268Updated 2 months ago
- π₯ [ICCV 2025 Highlight] Official ComfyUI native node supporting InfiniteYou with FLUXβ281Updated 3 months ago
- β413Updated 8 months ago
- Official implementation of MAGREF: Masked Guidance for Any-Reference Video Generation with Subject Disentanglementβ272Updated last month
- Calligrapher: Freestyle Text Image Customizationβ294Updated 2 months ago
- Towards Real-Time Diffusion-Based Streaming Video Super-Resolution β An efficient one-step diffusion framework for streaming VSR with locβ¦β806Updated 2 weeks ago
- Stand-In is a lightweight, plug-and-play framework for identity-preserving video generation.β663Updated 2 months ago
- [SIGGRAPH 2025] Official code of the paper "FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios"β339Updated 3 weeks ago
- We achieves high-quality first-frame guided video editing given a reference image, while maintaining flexibility for incorporating additiβ¦β319Updated 3 months ago
- Pusa: Thousands Timesteps Video Diffusion Modelβ661Updated 2 months ago
- Official code for AccVideo: Accelerating Video Diffusion Model with Synthetic Datasetβ268Updated 5 months ago
- [ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generationβ296Updated last year
- The official code implementation of the paper "OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data."β410Updated 5 months ago
- Official implementation for "Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model" (NeurIPS 2024)β256Updated 6 months ago
- Pytorch Implementation of FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing (ICLR 2024)β208Updated last year
- [CVPR'25]Tora: Trajectory-oriented Diffusion Transformer for Video Generationβ1,210Updated 4 months ago
- [Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Datasetβ489Updated 3 weeks ago
- FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformersβ480Updated 3 months ago
- Official comfyui repository of Hellomemeβ372Updated 4 months ago