Hritikbansal / videophyView external linksLinks
Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics
☆180Jan 30, 2026Updated 2 weeks ago
Alternatives and similar repositories for videophy
Users that are interested in videophy are comparing it to the libraries listed below
Sorting:
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆149Oct 25, 2024Updated last year
- official code repo of CVPR 2025 paper PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation☆60Jul 31, 2025Updated 6 months ago
- Benchmarking physical understanding in generative video models☆240Feb 2, 2026Updated last week
- Physical laws underpin all existence, and harnessing them for generative modeling opens boundless possibilities for advancing science and…☆267Dec 23, 2025Updated last month
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (ICML 2025)☆53May 8, 2025Updated 9 months ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆306Mar 12, 2025Updated 11 months ago
- Code repository for T2V-Turbo and T2V-Turbo-v2☆310Jan 31, 2025Updated last year
- ☆163Jan 6, 2025Updated last year
- Official Implementation of VideoDPO☆160Jun 1, 2025Updated 8 months ago
- CustomDiffusion360: Customizing Text-to-Image Diffusion with Camera Viewpoint Control☆172Dec 2, 2024Updated last year
- PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation (ECCV 2024)☆335Oct 24, 2024Updated last year
- [ICCV'25] ScenePainter: Semantically Consistent Perpetual 3D Scene Generation with Concept Relation Alignment☆36Oct 5, 2025Updated 4 months ago
- World Simulator Assistant for Physics-Aware Text-to-Video Generation☆259Sep 22, 2025Updated 4 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆111Dec 4, 2025Updated 2 months ago
- [ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation☆394May 30, 2025Updated 8 months ago
- [arXiv 2024] I4VGen: Image as Free Stepping Stone for Text-to-Video Generation☆24Oct 6, 2024Updated last year
- [CVPR2024 Highlight] VBench - We Evaluate Video Generation☆1,475Updated this week
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization☆24Apr 14, 2025Updated 10 months ago
- [CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis☆62Apr 27, 2025Updated 9 months ago
- ☆32Jul 29, 2025Updated 6 months ago
- [NeurIPS 2024] VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models☆176Sep 26, 2024Updated last year
- ☆636May 24, 2024Updated last year
- Video Diffusion State Space Models☆19Mar 27, 2024Updated last year
- Official implementation of MTM☆21Aug 30, 2023Updated 2 years ago
- Physics-based Zero-Shot Video Generation☆31Oct 4, 2024Updated last year
- ☆26Jun 22, 2024Updated last year
- Official Implementation of VideoRFSplat: Direct Scene-Level Text-to-3D Gaussian Splatting Generation with Flexible Pose and Multi-View Jo…☆23Jun 27, 2025Updated 7 months ago
- code release for HouseCrafter (ICCV 2025 Highlight)☆64Oct 23, 2025Updated 3 months ago
- Code for PhysDreamer☆610Feb 10, 2025Updated last year
- A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.☆259Feb 4, 2026Updated last week
- [ICML 2025] Official PyTorch Implementation of "History-Guided Video Diffusion"☆613Jul 1, 2025Updated 7 months ago
- [CVPR 2024] SceneWiz3D: Towards Text-guided 3D Scene Composition☆96May 4, 2024Updated last year
- [NeurIPS 2024] Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models☆334Jan 21, 2025Updated last year
- [3DV '25] Official repository of the paper "Gaussians-to-Life: Text-Driven Animation of 3D Gaussian Splatting Scenes".☆29Dec 2, 2024Updated last year
- Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]☆24Aug 13, 2024Updated last year
- [SIGGRAPH 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control☆804Jun 9, 2025Updated 8 months ago
- PyTorch implementation of the paper: CASAGPT: Cuboid Arrangement and Scene Assembly for Interior Design [CVPR 2025]☆14Apr 5, 2025Updated 10 months ago
- [ICLR26] GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning☆103Jan 27, 2026Updated 2 weeks ago
- [AAAI 2025] DreamPhysics: Learning Physics-Based 3D Dynamics with Video Diffusion Priors☆225Jun 7, 2024Updated last year