[ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation
☆149Oct 25, 2024Updated last year
Alternatives and similar repositories for PhyGenBench
Users that are interested in PhyGenBench are comparing it to the libraries listed below
Sorting:
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆180Jan 30, 2026Updated last month
- official code repo of CVPR 2025 paper PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation☆61Jul 31, 2025Updated 7 months ago
- Benchmarking physical understanding in generative video models☆244Feb 2, 2026Updated 3 weeks ago
- [CVPR2025] Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation☆18May 2, 2025Updated 9 months ago
- ICML 2025 - Impossible Videos☆83Jul 23, 2025Updated 7 months ago
- ☆163Jan 6, 2025Updated last year
- A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.☆263Feb 8, 2026Updated 2 weeks ago
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (ICML 2025)☆53May 8, 2025Updated 9 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆113Dec 4, 2025Updated 2 months ago
- Exposing Text-Image Inconsistency Using Diffusion Models (ICLR 2024)☆10Jun 15, 2024Updated last year
- ☆26Jul 29, 2025Updated 7 months ago
- ☆11Oct 2, 2024Updated last year
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Sep 11, 2025Updated 5 months ago
- MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision☆27May 26, 2025Updated 9 months ago
- [CVPR2024 Highlight] VBench - We Evaluate Video Generation☆1,485Updated this week
- Physical laws underpin all existence, and harnessing them for generative modeling opens boundless possibilities for advancing science and…☆268Dec 23, 2025Updated 2 months ago
- [ICCV 2025] Prompt-A-Video☆22Feb 2, 2025Updated last year
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆85May 4, 2025Updated 9 months ago
- PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation (ECCV 2024)☆335Oct 24, 2024Updated last year
- [ICCV 2025] MRGen: Segmentation Data Engine for Underrepresented MRI Modalities☆38Sep 26, 2025Updated 5 months ago
- [NeurIPS 2025] Improving Video Generation with Human Feedback☆428Sep 24, 2025Updated 5 months ago
- World Simulator Assistant for Physics-Aware Text-to-Video Generation☆260Sep 22, 2025Updated 5 months ago
- [ICLR2025] MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models☆94Sep 14, 2024Updated last year
- [ICCV 2025] Dynamic-VLM☆28Dec 16, 2024Updated last year
- [ACL 2024] ChartAssistant is a chart-based vision-language model for universal chart comprehension and reasoning.☆132Sep 7, 2024Updated last year
- VideoGen-Eval: Agent-based System for Video Generation Evaluation☆255Dec 16, 2025Updated 2 months ago
- Official repository of IDEA-Bench☆39Jan 24, 2025Updated last year
- Official code for the paper: Can3Tok (ICCV2025)☆36Aug 29, 2025Updated 5 months ago
- A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems☆414Sep 22, 2025Updated 5 months ago
- [CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis☆62Apr 27, 2025Updated 10 months ago
- [NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"☆31Updated this week
- ☆52Dec 13, 2024Updated last year
- ☆17Jul 30, 2024Updated last year
- [CVPR'2025] VoCo-LLaMA: This repo is the official implementation of "VoCo-LLaMA: Towards Vision Compression with Large Language Models".☆203Jun 18, 2025Updated 8 months ago
- Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"☆309Sep 28, 2025Updated 5 months ago
- Code repository for T2V-Turbo and T2V-Turbo-v2☆311Jan 31, 2025Updated last year
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection☆55Aug 16, 2025Updated 6 months ago
- Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion☆12Jan 14, 2026Updated last month
- ☆13Feb 2, 2025Updated last year