official code repo of CVPR 2025 paper PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation
☆62Jul 31, 2025Updated 7 months ago
Alternatives and similar repositories for PhyT2V
Users that are interested in PhyT2V are comparing it to the libraries listed below
Sorting:
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆150Oct 25, 2024Updated last year
- Code of the paper "FreePCA:Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Princi…☆28Aug 26, 2025Updated 6 months ago
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (ICML 2025)☆53May 8, 2025Updated 9 months ago
- World Simulator Assistant for Physics-Aware Text-to-Video Generation☆267Sep 22, 2025Updated 5 months ago
- PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation (ECCV 2024)☆338Oct 24, 2024Updated last year
- A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.☆267Feb 8, 2026Updated 3 weeks ago
- Public implementation of Video2Act: A Dual-System Video Diffusion Policy with Robotic Spatio-Motional Modeling☆30Dec 3, 2025Updated 3 months ago
- PhyGDPO: Physics-Aware Groupwise Direct Preference Optimization for Physically Consistent Text-to-Video Generation☆53Jan 5, 2026Updated 2 months ago
- Official implementation for the paper "Aligning Perception, Reasoning, Modeling and Interaction: A Survey on Physical AI"☆40Oct 26, 2025Updated 4 months ago
- Independent Multi-Modal Segmentation☆12Jun 12, 2025Updated 8 months ago
- [NeurIPS 2025] VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models☆164Jan 7, 2026Updated 2 months ago
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆85May 4, 2025Updated 10 months ago
- Physical laws underpin all existence, and harnessing them for generative modeling opens boundless possibilities for advancing science and…☆269Dec 23, 2025Updated 2 months ago
- [CVPR 2025] T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆106Oct 25, 2025Updated 4 months ago
- Finetuning and inference tools for the CogView4 and CogVideoX model series.☆118May 14, 2025Updated 9 months ago
- [ICCV 2025] Pytorch implementation of "VLIPP: Towards Physically Plausible Video Generation with Vision and Language Informed Physical Pr…☆49Jul 28, 2025Updated 7 months ago
- Official repo for the paper Learning Phase Distortion with Selective State Space Models for Video Turbulence Mitigation on CVPR 2025 (Hig…☆20May 12, 2025Updated 9 months ago
- ☆92May 25, 2024Updated last year
- [CVPR 2025] Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Long…☆322Mar 30, 2025Updated 11 months ago
- a comprehensive investigation of advanced physical aware AIGC works☆28Dec 13, 2025Updated 2 months ago
- ☆15Jan 8, 2024Updated 2 years ago
- Benchmarking physical understanding in generative video models☆257Feb 2, 2026Updated last month
- Exploring Representation-Aligned Latent Space for Better Generation☆17Feb 4, 2025Updated last year
- ICML 2025 - Impossible Videos☆83Jul 23, 2025Updated 7 months ago
- [CVPR 2025] Official implementation of ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way☆47Oct 10, 2025Updated 4 months ago
- Official Repo for the Paper Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control☆41Dec 30, 2024Updated last year
- DTact: A Vision-Based Tactile Sensor that Measures High-Resolution 3D Geometry Directly from Darkness (ICRA'23)☆20Aug 29, 2023Updated 2 years ago
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization☆25Apr 14, 2025Updated 10 months ago
- Official code for VINCIE: Unlocking In-context Image Editing from Video☆48Sep 8, 2025Updated 5 months ago
- [CVPR 2024] VidToMe: Video Token Merging for Zero-Shot Video Editing☆20Feb 29, 2024Updated 2 years ago
- [WIP] Code for LangToMo☆20Jun 25, 2025Updated 8 months ago
- [CVPR 2025] The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation☆111Oct 27, 2025Updated 4 months ago
- [RSS 2025] CLIP-RT : Learning Language-Conditioned Robotic Policies from Natural Language Supervision☆33May 13, 2025Updated 9 months ago
- ☆26Jun 22, 2024Updated last year
- [AAAI 2026] Minute-Long Videos with Dual Parallelisms☆46Nov 12, 2025Updated 3 months ago
- A framework for Longitudinal Radiology Report Generation☆26Aug 10, 2024Updated last year
- [NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Atten…☆64Jul 2, 2025Updated 8 months ago
- official repo of paper for "CamI2V: Camera-Controlled Image-to-Video Diffusion Model"☆165Sep 29, 2025Updated 5 months ago
- ☆165Jan 6, 2025Updated last year