Deaddawn / MovieLLM-code

☆176

Alternatives and similar repositories for MovieLLM-code

Users that are interested in MovieLLM-code are comparing it to the libraries listed below

Sorting:

md-mohaiminul / VideoRecap
☆186Updated 10 months ago
aim-uofa / AutoStory
[IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort
☆150Updated 5 months ago
Jeff-LiangF / FlowVid
☆143Updated 10 months ago
AILab-CVC / SEED-X
Multimodal Models in Real World
☆503Updated 2 months ago
ID-Animator / ID-Animator
☆375Updated 11 months ago
gpt4video / GPT4Video
Offical Code for GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation
☆139Updated 6 months ago
bytedance / Shot2Story
A new multi-shot video understanding benchmark Shot2Story with comprehensive video summaries and detailed shot-level captions.
☆131Updated 3 months ago
junjiehe96 / UniPortrait
UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalization
☆248Updated last week
JianhongBai / UniEdit
UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing
☆107Updated 3 weeks ago
qinghew / CharacterFactory
[TIP 2025] CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models 🔥
☆211Updated 3 weeks ago
aim-uofa / MovieDreamer
[ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences
☆300Updated 9 months ago
farewellthree / PPLLaVA
Official GPU implementation of the paper "PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance"
☆130Updated 5 months ago
AILab-CVC / FreeNoise
[ICLR 2024] Code for FreeNoise based on VideoCrafter
☆407Updated 10 months ago
WangWenhao0716 / VidProM
[NeurIPS 2024] VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models
☆146Updated 7 months ago
modelscope / lite-sora
An initiative to replicate Sora
☆104Updated last year
Vchitect / Vlogger
[CVPR2024] Make Your Dream A Vlog
☆423Updated last year
Francis-Rings / MotionEditor
[CVPR2024] MotionEditor is the first diffusion-based model capable of video motion editing.
☆167Updated 3 weeks ago
Ji4chenLi / t2v-turbo
Code repository for T2V-Turbo and T2V-Turbo-v2
☆299Updated 3 months ago
HaozheZhao / UltraEdit
☆229Updated 9 months ago
bytedance / vidi
The official repo for "Vidi: Large Multimodal Models for Video Understanding and Editing"
☆96Updated 3 weeks ago
YangLing0818 / VideoTetris
[NeurIPS 2024] VideoTetris: Towards Compositional Text-To-Video Generation
☆218Updated 6 months ago
MC-E / ReVideo
NeurIPS 2024
☆382Updated 7 months ago
mulanai / MuLan
MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)
☆136Updated 3 months ago
Francis-Rings / MotionFollower
MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion
☆215Updated 3 weeks ago
IVGSZ / Flash-VStream
This is the official implementation of "Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams"
☆181Updated 4 months ago
AILab-CVC / VideoGen-Eval
VideoGen-Eval: Agent-based System for Video Generation Evaluation
☆230Updated last month
MS-Diffusion / MS-Diffusion
[ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
☆266Updated last month
knightyxp / VideoGrain
[ICLR 2025] VideoGrain: This repo is the official implementation of "VideoGrain: Modulating Space-Time Attention for Multi-Grained Video …
☆124Updated last month
I2V-Adapter / I2V-Adapter-repo
I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models
☆205Updated last year
showlab / VideoSwap
Code for [CVPR 2024] VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence
☆390Updated 5 months ago