[ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation
☆158Oct 25, 2024Updated last year
Alternatives and similar repositories for PhyGenBench
Users that are interested in PhyGenBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆192Jan 30, 2026Updated 2 months ago
- official code repo of CVPR 2025 paper PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation☆64Jul 31, 2025Updated 9 months ago
- Benchmarking physical understanding in generative video models☆285Apr 16, 2026Updated last week
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (ICML 2025)☆56May 8, 2025Updated 11 months ago
- [CVPR2025] Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation☆19May 2, 2025Updated 11 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ACL 2024] ChartAssistant is a chart-based vision-language model for universal chart comprehension and reasoning.☆134Sep 7, 2024Updated last year
- A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.☆291Apr 2, 2026Updated 3 weeks ago
- Exposing Text-Image Inconsistency Using Diffusion Models (ICLR 2024)☆10Jun 15, 2024Updated last year
- World Simulator Assistant for Physics-Aware Text-to-Video Generation☆273Sep 22, 2025Updated 7 months ago
- Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion☆12Jan 14, 2026Updated 3 months ago
- ☆174Jan 6, 2025Updated last year
- [CVPR2024 Highlight] VBench - We Evaluate Video Generation☆1,605Mar 23, 2026Updated last month
- PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation (ECCV 2024)☆346Oct 24, 2024Updated last year
- ICML 2025 - Impossible Videos☆83Jul 23, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆116Dec 4, 2025Updated 4 months ago
- Official code for the paper: Can3Tok (ICCV2025)☆43Aug 29, 2025Updated 8 months ago
- [ICCV 2025] Prompt-A-Video☆23Feb 2, 2025Updated last year
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆86May 4, 2025Updated 11 months ago
- [NeurIPS 2025] Improving Video Generation with Human Feedback☆454Sep 24, 2025Updated 7 months ago
- [CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis☆62Mar 31, 2026Updated 3 weeks ago
- ☆17Jul 30, 2024Updated last year
- A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems☆430Sep 22, 2025Updated 7 months ago
- [NIPS2023]Implementation of Foundation Model is Efficient Multimodal Multitask Model Selector☆37Mar 7, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- BESA is a differentiable weight pruning technique for large language models.☆17Mar 4, 2024Updated 2 years ago
- [CVPR'2025] VoCo-LLaMA: This repo is the official implementation of "VoCo-LLaMA: Towards Vision Compression with Large Language Models".☆203Jun 18, 2025Updated 10 months ago
- Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation☆574Sep 16, 2024Updated last year
- VideoGen-Eval: Agent-based System for Video Generation Evaluation☆262Dec 16, 2025Updated 4 months ago
- Official repository of IDEA-Bench☆40Jan 24, 2025Updated last year
- [ICCV 2025] Pytorch implementation of "VLIPP: Towards Physically Plausible Video Generation with Vision and Language Informed Physical Pr…☆54Jul 28, 2025Updated 9 months ago
- MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision☆28May 26, 2025Updated 11 months ago
- MagicVFX: Visual Effects Synthesis in Just Minutes☆17Dec 16, 2024Updated last year
- [ICCV 2025] MRGen: Segmentation Data Engine for Underrepresented MRI Modalities☆39Sep 26, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning☆773Sep 7, 2025Updated 7 months ago
- TriMe++: a multi-threaded software library for 2D geometry meshing using the Delaunay triangulation☆19Feb 20, 2026Updated 2 months ago
- Open-source code for GEAR☆13Dec 3, 2025Updated 4 months ago
- ☆39Jul 29, 2025Updated 9 months ago
- The official code of "Beyond VLM-Based Rewards: Diffusion-Native Latent Reward Modeling"☆51Feb 26, 2026Updated 2 months ago
- [CVPR 2025] The First Investigation of CoT Reasoning (RL, TTS, Reflection) in Image Generation☆860Mar 19, 2026Updated last month
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controller☆50Aug 5, 2025Updated 8 months ago