inFaaa / Awesome-Personalized-Video-CreationLinks

📖 This is a repository for organizing papers, codes, and other resources related to personalized video generation and editing.

☆57

Alternatives and similar repositories for Awesome-Personalized-Video-Creation

Users that are interested in Awesome-Personalized-Video-Creation are comparing it to the libraries listed below

Sorting:

Fr0zenCrane / UniCoT
Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Vision
☆167Updated last week
shxie2020 / Awesome-UGVFM
A collection of vision foundation models unifying understanding and generation.
☆58Updated 10 months ago
PKU-YuanGroup / UAE
Official repository for the UAE paper, unified-GRPO, and unified-Bench
☆147Updated 2 months ago
thuml / MiniVeo3-Reasoner
Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…
☆181Updated last month
gogoduan / GoT-R1
GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning
☆100Updated 5 months ago
HorizonWind2004 / reconstruction-alignment
Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potential in Unifie…
☆310Updated last month
wusize / OpenUni
☆162Updated 4 months ago
TencentARC / MindOmni
☆132Updated last month
aim-uofa / dLLM-MidTruth
☆56Updated 3 months ago
multimodal-reasoning-lab / Bagel-Zebra-CoT
https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT
☆101Updated 2 weeks ago
ModelTC / HarmoniCa
[ICML 2025] This is the official PyTorch implementation of "🎵 HarmoniCa: Harmonizing Training and Inference for Better Feature Caching i…
☆43Updated 4 months ago
rongyaofang / GoT
Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"
☆295Updated last month
weijiawu / Awesome-Visual-Reinforcement-Learning
📖 This is a repository for organizing papers, codes and other resources related to Visual Reinforcement Learning.
☆332Updated last week
PKU-YuanGroup / WISE
WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation
☆161Updated 2 weeks ago
Tiezheng11 / Vision-Language-Vision
☆62Updated 4 months ago
TencentARC / TokLIP
TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation
☆231Updated 3 months ago
Franklin-Zhang0 / ReasonGen-R1
Official respository for ReasonGen-R1
☆73Updated 4 months ago
Tencent / HaploVLM
ICML2025
☆60Updated 2 months ago
wusize / Harmon
[ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation
☆177Updated 6 months ago
facebookresearch / metamorph
Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning
☆221Updated 7 months ago
UMass-Embodied-AGI / Mirage
Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens (arXiv 2025)
☆191Updated 3 months ago
InternLM / Spatial-SSRL
Official release of "Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning"
☆71Updated this week
TencentARC / ARC-Hunyuan-Video-7B
Structured Video Comprehension of Real-World Shorts
☆216Updated 2 months ago
PhoenixZ810 / RISEBench
[NIPS 2025 DB Oral] Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing
☆114Updated last month
egolife-ai / Ego-R1
Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning
☆127Updated 3 months ago
SxJyJay / UniToken
[CVPRW 2025] UniToken is an auto-regressive generation model that combines discrete and continuous representations to process visual inpu…
☆97Updated 6 months ago
mll-lab-nu / TStar
TStar is a unified temporal search framework for long-form video question answering
☆71Updated 2 months ago
aniki-ly / FreeLong
[NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Atten…
☆60Updated 4 months ago
showlab / Impossible-Videos
ICML 2025 - Impossible Videos
☆78Updated 3 months ago
aHapBean / VideoREPA
[NeurIPS 2025] VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models
☆115Updated 2 weeks ago