Vchitect / Vchitect-2.0Links

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

☆913

Alternatives and similar repositories for Vchitect-2.0

Users that are interested in Vchitect-2.0 are comparing it to the libraries listed below

Sorting:

360CVGroup / FancyVideo
Video generation from text&image, 1st-gen
☆919Updated 9 months ago
alibaba / Tora
[CVPR'25]Tora: Trajectory-oriented Diffusion Transformer for Video Generation
☆1,228Updated 7 months ago
thu-ml / DiT-Extrapolation
Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025) and UltraViCo (IC…
☆785Updated last week
SkyworkAI / UniPic
Open-source SOTA multi-image editing model
☆850Updated 2 weeks ago
Alpha-VLLM / Lumina-Video
☆414Updated 11 months ago
NJU-PCALab / RAG-Diffusion
[ICCV 2025] Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement 🔥
☆620Updated 2 months ago
ShareGPT4Omni / ShareGPT4Video
[NeurIPS 2024] An official implementation of "ShareGPT4Video: Improving Video Understanding and Generation with Better Captions"
☆1,085Updated last year
megvii-research / megactor
☆899Updated last year
showlab / Show-1
[IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
☆1,134Updated 4 months ago
Tencent-Hunyuan / MixGRPO
MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE
☆1,092Updated last week
rhymes-ai / Allegro
Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple te…
☆1,116Updated last year
XueZeyue / DanceGRPO
An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation
☆1,499Updated 3 months ago
PKU-YuanGroup / ConsisID
[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition
☆812Updated 5 months ago
Alpha-VLLM / Lumina-mGPT-2.0
Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling
☆1,077Updated 3 months ago
ali-vilab / UniAnimate
Code for SCIS-2025 Paper "UniAnimate: Taming Unified Video Diﬀusion Models for Consistent Human Image Animation".
☆1,180Updated 9 months ago
damo-cv / RealisDance
The official implementation of RealisDance
☆610Updated 7 months ago
Doby-Xu / WithAnyone
✨ WithAnyone is capable of generating high-quality, controllable, and ID consistent images
☆550Updated last month
yejy53 / Echo-4o
Echo-4o: Harnessing Proprietary Models’ Synthetic Images for Improved Image Generation
☆505Updated 2 months ago
Alpha-VLLM / Lumina-DiMOO
Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model
☆937Updated last month
showlab / MotionDirector
[ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
☆1,038Updated last year
zibojia / COCOCO
Video-Inpaint-Anything: This is the inference code for our paper CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, C…
☆320Updated last year
AlaaLab / InstructCV
[ ICLR 2024 ] Official Codebase for "InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists"
☆461Updated last year
sy77777en / CameraBench
[NeurIPS 2025 Spotlight] Towards Understanding Camera Motions in Any Video
☆269Updated 2 months ago
360CVGroup / Qihoo-T2X
Efficient DiT architecture for text2any tasks, ICLR2025
☆447Updated 9 months ago
JavisVerse / JavisDiT
Official implementation of "JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization"
☆316Updated last month
fudan-generative-vision / hallo3
[CVPR 2025] Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer
☆1,367Updated 10 months ago
fudan-generative-vision / OpenHumanVid
[CVPR 2025] A Large-Scale High-Quality Dataset for Enhancing Human-Centric Video Generation
☆322Updated 11 months ago
Ephemeral182 / PosterCraft
[ICLR'26] Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework
☆527Updated 2 weeks ago
FoundationVision / Liquid
(Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators
☆637Updated 3 months ago
Ola-Omni / Ola
Ola: Pushing the Frontiers of Omni-Modal Language Model
☆386Updated 7 months ago