bytedance / XVerseLinks
Official implementation of "XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation".
β550Updated last week
Alternatives and similar repositories for XVerse
Users that are interested in XVerse are comparing it to the libraries listed below
Sorting:
- Implementation of "FLUX-Text: A Simple and Advanced Diffusion Transformer Baseline for Scene Text Editing"β344Updated last week
- [CVPR 2025 Highlightπ₯] Identity-Preserving Text-to-Video Generation by Frequency Decompositionβ738Updated last month
- Rethinking High-Quality Aesthetic Poster Generation in a Unified Frameworkβ479Updated last month
- UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformerβ750Updated 3 months ago
- β246Updated last week
- [TIP 2025] From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generationβ191Updated 3 months ago
- [AAAI 2025] Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generationβ160Updated last month
- [ICCV 2025] Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement π₯β591Updated last month
- Calligrapher: Freestyle Text Image Customizationβ268Updated 2 weeks ago
- Official comfyui repository of Hellomemeβ368Updated last month
- Official implementation of MAGREF: Masked Guidance for Any-Reference Video Generationβ240Updated 2 weeks ago
- Unified Autoregressive Modeling for Visual Understanding and Generationβ179Updated this week
- We achieves high-quality first-frame guided video editing given a reference image, while maintaining flexibility for incorporating additiβ¦β290Updated last month
- Official project page of MTVCrafter, a new paradigm for animating arbitrary characters with 4D motion tokens.β225Updated last week
- β408Updated 4 months ago
- The official code implementation of the paper "OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data."β386Updated last month
- π₯ [ICCV 2025 Highlight] Official ComfyUI native node supporting InfiniteYou with FLUXβ211Updated last week
- Official code for AccVideo: Accelerating Video Diffusion Model with Synthetic Datasetβ257Updated last month
- The official implementation of RealisDanceβ585Updated last month
- Codes for ID-Specific Video Customized Diffusionβ454Updated last year
- β1,231Updated this week
- β88Updated last month
- Pytorch Implementation of FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing (ICLR 2024)β204Updated last year
- Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025)β702Updated 2 months ago
- [ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generationβ295Updated last year
- Pusa: Thousands Timesteps Video Diffusion Modelβ536Updated this week
- MoviiGen 1.1: Towards Cinematic-Quality Video Generative Modelsβ167Updated last week
- [ICCV 2025] LayerAnimate: Layer-specific Control for Animationβ181Updated last month
- [SIGGRAPH 2025] Official code of the paper "FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios"β311Updated 2 months ago
- [CVPR-2025] The official code of HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animationβ280Updated last month