thu-coai/VPO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thu-coai/VPO)

thu-coai / VPO

☆25

Alternatives and similar repositories for VPO

Users that are interested in VPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

VidCapBench / VidCapBench
View on GitHub
☆13May 17, 2025Updated last year
Vicky0522 / TokensGen
View on GitHub
[ICCV 2025] TokensGen: Harnessing Condensed Tokens for Long Video Generation
☆57Dec 10, 2025Updated 7 months ago
JianhuiWei7 / UniVBench
View on GitHub
[CVPR 2026]The official code and datasets for "UniVBench: Towards Unified Evaluation for Video Foundation Models"
☆23May 27, 2026Updated last month
JPShi12 / VideoLoom
View on GitHub
[ICML 2026] VideoLoom: A Video Large Language Model for Joint Spatial-Temporal Understanding
☆27Jul 3, 2026Updated 3 weeks ago
LanDiff / LanDiff
View on GitHub
The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation
☆41May 4, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
thu-coai / BARREL
View on GitHub
[ICLR 2026] BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs
☆18May 21, 2025Updated last year
NVlabs / FRAG
View on GitHub
☆15Apr 25, 2025Updated last year
hit-perfect / Awesome-Video-World-Models
View on GitHub
A Mechanistic View on Video Generation as World Models: State and Dynamics
☆55Jun 27, 2026Updated 3 weeks ago
daeunni / Video-Skill-CoT
View on GitHub
Code for "Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning [EMNLP 2025 Findings]"
☆18Aug 27, 2025Updated 11 months ago
why986 / VFA
View on GitHub
Official implementation of MM2023 paper "Versatile Face Animator: Driving Arbitrary 3D Facial Avatar in RGBD Space"
☆29Oct 9, 2023Updated 2 years ago
CIntellifusion / VideoDPO
View on GitHub
Official Implementation of VideoDPO
☆169Jun 1, 2025Updated last year
KlingAIResearch / VANS
View on GitHub
[CVPR 2026] Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO
☆119Feb 28, 2026Updated 4 months ago
SAIS-FUXI / IPO
View on GitHub
☆58May 6, 2025Updated last year
zai-org / VisionReward
View on GitHub
[AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation
☆422Mar 26, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Thrcle421 / DiT-Mem
View on GitHub
Learning Plug-and-play Memory for Guiding Video Diffusion Models
☆26Dec 1, 2025Updated 7 months ago
deepshwang / crepa
View on GitHub
☆15Jun 21, 2025Updated last year
openmedlab / Swin-UMamba
View on GitHub
☆14May 23, 2024Updated 2 years ago
Vchitect / Cut2Next
View on GitHub
Cut2Next: Generating Next Shot via In-Context Tuning
☆33Aug 21, 2025Updated 11 months ago
ruili33 / TPO
View on GitHub
☆41Sep 9, 2025Updated 10 months ago
lwang88 / ct_synthesis
View on GitHub
☆15Jul 24, 2022Updated 4 years ago
Vchitect / RAPO
View on GitHub
[CVPR 2025] The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation
☆105Oct 27, 2025Updated 8 months ago
thu-coai / SPaR
View on GitHub
☆47Jun 11, 2025Updated last year
zishen-ucap / PromptTea
View on GitHub
This repository contains the official implementation of our paper: PromptTea: Let Prompts Tell TeaCache the Optimal Threshold
☆35Oct 27, 2025Updated 8 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hlchen23 / VERIFIED
View on GitHub
Official repository of NeurIPS D&B Track 2024 paper "VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video Understan…
☆40Jan 20, 2025Updated last year
Carmenw1203 / DanceCamAnimator-Official
View on GitHub
DanceCamAnimator: Keyframe-Based Controllable 3D Dance Camera Synthesis. [ACMMM 2024] Official PyTorch implementation
☆41Sep 24, 2024Updated last year
WillWu111 / ViBe
View on GitHub
[ECCV2026] ViBe: Ultra-High-Resolution Video Synthesis Born from Pure Images
☆31May 21, 2026Updated 2 months ago
Bujiazi / HiFlow
View on GitHub
[NeurIPS 2025] Official implementation of HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance
☆88Sep 18, 2025Updated 10 months ago
CogComp / Salient-Event-Detection
View on GitHub
The repository for the paper "Is Killed More Significant than Fled? A Contextual Model for Salient Event Detection"
☆10Jul 5, 2022Updated 4 years ago
SwiftieH / SpGAT
View on GitHub
Spectral Graph Attention Network with Fast Eigen-approximation
☆11Dec 24, 2021Updated 4 years ago
zai-org / SSVAE
View on GitHub
official implementation of the paper "Delving into Latent Spectral Biasing of Video VAEs for Superior Diffusability".
☆72Dec 25, 2025Updated 7 months ago
caiyuanhao1998 / Open-PhyGDPO
View on GitHub
PhyGDPO: Physics-Aware Groupwise Direct Preference Optimization for Physically Consistent Text-to-Video Generation (ECCV 2026)
☆69Jun 20, 2026Updated last month
xzc-zju / AdaVideoRAG
View on GitHub
[NeurIPS 2025] AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented Efficient Long Video Understanding
☆15Jun 16, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ZackZikaiXiao / Awesome-Agent-Environments
View on GitHub
Awesome Agent Environments
☆17Apr 10, 2026Updated 3 months ago
Fantasy-AMAP / fantasy-talking2
View on GitHub
[AAAI 2026] FantasyTalking2: Timestep-Layer Adaptive Preference Optimization for Audio-Driven Portrait Animation
☆65Aug 20, 2025Updated 11 months ago
thu-coai / AutoDetect
View on GitHub
Official github repo for AutoDetect, an automated weakness detection framework for LLMs.
☆47Jun 25, 2024Updated 2 years ago
ckinpdx / ComfyUI-SCAIL-AudioReactive
View on GitHub
Generate audio-reactive SCAIL pose sequences for character animation without requiring input video tracking.
☆17Jan 2, 2026Updated 6 months ago
Harvard-AI-and-Robotics-Lab / FiVE-Bench
View on GitHub
[ICCV 2025] FiVE-Bench: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models
☆19Aug 26, 2025Updated 11 months ago
ryeii / CLUE
View on GitHub
Safe Model-Based RL HVAC Control Using Epistemic Uncertainty Estimation.
☆13Feb 25, 2025Updated last year
WeijiaZhang24 / DCSurvival
View on GitHub
☆11Apr 5, 2024Updated 2 years ago