zhuangshaobin/Video-GPT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zhuangshaobin/Video-GPT)

zhuangshaobin / Video-GPT

[ICLR2026] Video-GPT via Next Clip Diffusion.

☆46

Alternatives and similar repositories for Video-GPT

Users that are interested in Video-GPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zhuangshaobin / WeTok
View on GitHub
[ICLR2026] WeTok: Powerful Discrete Tokenization for High-Fidelity Visual Reconstruction
☆70Sep 3, 2025Updated 10 months ago
google-deepmind / visual-memory
View on GitHub
Code & data for "Towards flexible perception with visual memory" (ICML 2025)
☆19Sep 24, 2024Updated last year
xiefan-guo / i4vgen
View on GitHub
[arXiv 2024] I4VGen: Image as Free Stepping Stone for Text-to-Video Generation
☆24Oct 6, 2024Updated last year
DINGYANB / MUSES
View on GitHub
（AAAI 2025）MUSES: 3D-Controllable Image Generation via Multi-Modal Agent Collaboration
☆37May 21, 2025Updated last year
TtuHamg / TextToucher
View on GitHub
Official Pytorch Implementation for "TextToucher: Fine-Grained Text-to-Touch Generation" (AAAI 2025)
☆19Jan 28, 2026Updated 5 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Vicky0522 / TokensGen
View on GitHub
[ICCV 2025] TokensGen: Harnessing Condensed Tokens for Long Video Generation
☆57Dec 10, 2025Updated 7 months ago
desaixie / pa_vdm
View on GitHub
CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151
☆89May 12, 2025Updated last year
markywg / transagent
View on GitHub
[NeurIPS 2024] TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration
☆25Oct 17, 2024Updated last year
TtuHamg / DriveDiTFit
View on GitHub
Official Pytorch Implementation for "DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving Data Generation" (TOMM)
☆24Mar 7, 2025Updated last year
GigaAI-research / GigaVideo-1
View on GitHub
☆17Jun 13, 2025Updated last year
walker1126 / Latent_Action_Composition
View on GitHub
[ICCV 2023] Latent Action Composition for Skeleton-based Action Segmentation
☆22Oct 25, 2023Updated 2 years ago
YouDream3D / YouDream
View on GitHub
YouDream: Generating Anatomically Controllable Consistent Text-to-3D Animals
☆40Feb 9, 2025Updated last year
Tencent / HaploVLM
View on GitHub
ICML2025
☆63Aug 28, 2025Updated 10 months ago
threedle / hyperfields
View on GitHub
☆23Dec 11, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
hi-zhengcheng / vividzoo
View on GitHub
☆39Oct 19, 2024Updated last year
daeunni / VideoRepair
View on GitHub
Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement [ACL 2026 Findings]"
☆52Apr 7, 2026Updated 3 months ago
SAIS-FUXI / Omni-Video
View on GitHub
☆157Feb 28, 2026Updated 4 months ago
stepfun-ai / NextStep-1
View on GitHub
[🚀 ICLR 2026 Oral] NextStep-1: SOTA Autogressive Image Generation with Continuous Tokens. A research project developed by the StepFun’s …
☆691Feb 27, 2026Updated 4 months ago
pengbo807 / ConditionVideo
View on GitHub
Training-Free Condition-Guided Text-to-Video Generation
☆62Oct 23, 2025Updated 9 months ago
YixunLiang / UniTEX-FLUX
View on GitHub
Flux training codes (lora) for UniTEX
☆25Jun 8, 2025Updated last year
showlab / FAR
View on GitHub
Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"
☆312Apr 23, 2025Updated last year
snap-research / ac3d
View on GitHub
AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers
☆164Sep 16, 2025Updated 10 months ago
showlab / TPDiff
View on GitHub
TPDiff: Temporal Pyramid Video Diffusion Model
☆25Mar 13, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
itaychachy / RewardSDS
View on GitHub
Official PyTorch Implementation for the "RewardSDS: Aligning Score Distillation via Reward-Weighted Sampling" paper!
☆13Jun 10, 2025Updated last year
wyhsirius / g3an-project
View on GitHub
[CVPR 2020] G3AN: Disentangling Appearance and Motion for Video Generation
☆37Feb 5, 2021Updated 5 years ago
google-deepmind / physics-IQ-benchmark
View on GitHub
Benchmarking physical understanding in generative video models
☆323Jun 22, 2026Updated last month
KlingAIResearch / Alchemist
View on GitHub
☆38Dec 19, 2025Updated 7 months ago
lzt02 / NiRNE
View on GitHub
☆26Aug 12, 2025Updated 11 months ago
yukangcao / AvatarGO
View on GitHub
[ICLR' 25] AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation
☆69Mar 19, 2025Updated last year
KlingAIResearch / VideoCanvas
View on GitHub
Official Code of "VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning"
☆68Oct 10, 2025Updated 9 months ago
JosephTiTan / FreePCA
View on GitHub
Code of the paper "FreePCA：Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Princi…
☆27Apr 3, 2026Updated 3 months ago
KlingAIResearch / MultiShotMaster
View on GitHub
CVPR 2026 | Official Implementation of "MultiShotMaster: A Controllable Multi-Shot Video Generation Framework"
☆171Feb 22, 2026Updated 5 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Yuxuan-W / Nautilus
View on GitHub
[ICCV 2025] Nautilus: Locality-aware Autoencoder for Scalable Mesh Generation
☆59Jun 25, 2025Updated last year
harisreedhar / Portrait-Talker
View on GitHub
Talking head animation
☆26Dec 8, 2023Updated 2 years ago
Ji4chenLi / t2v-turbo
View on GitHub
Code repository for T2V-Turbo and T2V-Turbo-v2
☆312Jan 31, 2025Updated last year
Costwen / Ouroboros3D
View on GitHub
Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion (CVPR2025)
☆150Oct 22, 2025Updated 9 months ago
AILab-CVC / FreeNoise
View on GitHub
[ICLR 2024] Code for FreeNoise based on VideoCrafter
☆429Aug 25, 2025Updated 11 months ago
wendell0218 / Janus-Pro-R1
View on GitHub
[NeurIPS 2025] Official repository of the paper "Unlocking Aha Moments via Reinforcement Learning: Advancing Collaborative Visual Compreh…
☆23Sep 27, 2025Updated 9 months ago
Junchao-cs / LIVE
View on GitHub
[ICML 2026] "LIVE: Long-horizon Interactive Video World ModEling"
☆35Jul 15, 2026Updated last week