Tianhao-Qi/Mask2DiT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Tianhao-Qi/Mask2DiT)

Tianhao-Qi / Mask2DiT

CVPR 2025 Accepted Papers

☆26

Alternatives and similar repositories for Mask2DiT

Users that are interested in Mask2DiT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Phantom-video / LibraGen
View on GitHub
☆17Mar 19, 2026Updated 4 months ago
jiawn-creator / Dynamic-DiT
View on GitHub
☆18Mar 21, 2025Updated last year
Vchitect / CineTrans
View on GitHub
CineTrans: Learning to Generate Videos with Cinematic Transitions via Masked Diffusion Models
☆32Feb 3, 2026Updated 5 months ago
Jayce1kk / SpaceVLLM
View on GitHub
SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capability
☆17May 8, 2025Updated last year
jacklishufan / Reflect-DiT
View on GitHub
Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection
☆56Aug 16, 2025Updated 11 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Hritikbansal / talc
View on GitHub
☆30May 9, 2024Updated 2 years ago
Phantom-video / Phantom-Data
View on GitHub
Phantom-Data: Towards a General Subject-Consistent Video Generation Dataset
☆115Feb 25, 2026Updated 4 months ago
bytedance / RealCustom
View on GitHub
☆97Nov 6, 2025Updated 8 months ago
ymju-BAAI / CI-VID
View on GitHub
☆29Sep 4, 2025Updated 10 months ago
BerserkerVV / Video2LoRA
View on GitHub
Video2LoRA: Unified Semantic-Controlled Video Generation via Per-Reference-Video LoRA （CVPR 2026 Findings）
☆22May 25, 2026Updated last month
FrameX-AI / Stream-R1
View on GitHub
☆54May 6, 2026Updated 2 months ago
Corleone-Huang / DynamicVectorQuantization
View on GitHub
☆21Jun 3, 2023Updated 3 years ago
Vchitect / Cut2Next
View on GitHub
Cut2Next: Generating Next Shot via In-Context Tuning
☆33Aug 21, 2025Updated 11 months ago
ruffiann / MagicVFX
View on GitHub
MagicVFX: Visual Effects Synthesis in Just Minutes
☆18Dec 16, 2024Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
feizc / Video-In-Context
View on GitHub
Video Diffusion Transformers are In-Context Learners
☆37Jan 6, 2025Updated last year
yhlleo / EfficientMoE
View on GitHub
Official implementation of "Efficient Training of Diffusion MoE models: A Practical Recipe"
☆17Dec 21, 2025Updated 7 months ago
Corleone-Huang / RealCustomProject
View on GitHub
☆19Apr 16, 2025Updated last year
deepshwang / crepa
View on GitHub
☆15Jun 21, 2025Updated last year
Vicky0522 / TokensGen
View on GitHub
[ICCV 2025] TokensGen: Harnessing Condensed Tokens for Long Video Generation
☆57Dec 10, 2025Updated 7 months ago
harshbhatt7585 / StillMoving
View on GitHub
☆17Jul 30, 2024Updated last year
AMD-AGI / ReNeg
View on GitHub
ReNeg: Learning Negative Embedding with Reward Guidance
☆18Jun 4, 2026Updated last month
byhuang123 / PoCo
View on GitHub
[CVPR2026] Official implementation of our paper “Rethinking Position Embedding as a Context Controller for Multi-Reference and Multi-Shot…
☆19Apr 8, 2026Updated 3 months ago
FrameX-AI / Stream-T1
View on GitHub
☆37Jun 23, 2026Updated 3 weeks ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Phantom-video / OmniInsert
View on GitHub
OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models
☆162Mar 4, 2026Updated 4 months ago
KlingAIResearch / MultiShotMaster
View on GitHub
CVPR 2026 | Official Implementation of "MultiShotMaster: A Controllable Multi-Shot Video Generation Framework"
☆168Feb 22, 2026Updated 4 months ago
KlingAIResearch / ShotStream
View on GitHub
[ECCV 2026] ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling
☆171Jun 23, 2026Updated 3 weeks ago
Lyne1 / Realgeneral
View on GitHub
RealGeneral (ICCV2025)
☆17Jul 16, 2025Updated last year
robingg1 / PoseTraj
View on GitHub
[CVPR 2025] PoseTraj: Pose-Aware Trajectory Control in Video Diffusion
☆23May 26, 2026Updated last month
snap-research / MSRVTT-Personalization
View on GitHub
Benchmark dataset and code of MSRVTT-Personalization
☆52Nov 10, 2025Updated 8 months ago
JosephTiTan / FreePCA
View on GitHub
Code of the paper "FreePCA：Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Princi…
☆27Apr 3, 2026Updated 3 months ago
flying-sky999 / OmniV2V
View on GitHub
☆15Jun 2, 2025Updated last year
KlingAIResearch / DecMem
View on GitHub
DecMem: Towards Minute-Long Consistent World Generation with Decoupled Memory
☆26Jun 1, 2026Updated last month
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
UCSC-VLAA / Complex-Edit
View on GitHub
Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark
☆29Apr 22, 2025Updated last year
apple / ml-unigen
View on GitHub
UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation
☆43Nov 24, 2025Updated 7 months ago
justincui03 / Self-Forcing-Plus-Plus
View on GitHub
Official Repo for Self-Forcing++ High Quality Long Video Generation
☆264Oct 13, 2025Updated 9 months ago
KlingAIResearch / MemFlow
View on GitHub
Official Implementation of "MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives"
☆214Dec 29, 2025Updated 6 months ago
Vchitect / RAPO
View on GitHub
[CVPR 2025] The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation
☆105Oct 27, 2025Updated 8 months ago
vTAD2025-Challenge / vTAD
View on GitHub
☆15Oct 24, 2025Updated 8 months ago
Tencent / HaploVLM
View on GitHub
ICML2025
☆63Aug 28, 2025Updated 10 months ago