[ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding>
☆85Jan 21, 2026Updated last month
Alternatives and similar repositories for PhysBench
Users that are interested in PhysBench are comparing it to the libraries listed below
Sorting:
- M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning☆46Jul 17, 2025Updated 7 months ago
- ☆21Nov 5, 2024Updated last year
- ☆23Feb 14, 2025Updated last year
- https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT☆124Jan 30, 2026Updated last month
- [CVPR 2025] Official code for the paper "SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting Synthesis"☆135Mar 18, 2025Updated 11 months ago
- MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research☆22Sep 23, 2025Updated 5 months ago
- init☆10May 25, 2025Updated 9 months ago
- ☆11Sep 5, 2025Updated 5 months ago
- Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions☆21Feb 11, 2026Updated 2 weeks ago
- The Vulkan GPU radix sort implementation from Google Fuchsia, but with CMake☆12Jan 13, 2023Updated 3 years ago
- ☆55Aug 5, 2025Updated 6 months ago
- Official Implementation of OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation☆38Jul 5, 2025Updated 7 months ago
- ☆43May 6, 2024Updated last year
- ☆16Jan 5, 2025Updated last year
- 🚀 Official code for “XStreamVGGT: Extremely Memory-Efficient Streaming Vision Geometry Grounded Transformer with KV Cache Compression”, …☆35Jan 27, 2026Updated last month
- Evaluating Durability: Benchmark Insights into Multimodal Watermarking☆12Jun 7, 2024Updated last year
- PhysVLM: Enabling Visual Language Models to Understand Robotic Physical Reachability☆37Mar 18, 2025Updated 11 months ago
- Efficient Scaling laws and collaborative pretraining.☆21Sep 18, 2025Updated 5 months ago
- Vision-Language-Action Optimization with Trajectory Ensemble Voting☆25Feb 18, 2026Updated last week
- Official implementation of "Art-Free Generative Models: Art Creation Without Graphic Art Knowledge"☆32Nov 30, 2025Updated 3 months ago
- Code release for 'Struct2D: A Perception-Guided Framework for Spatial Reasoning in MLLMs' (NeurIPS 2025)☆30Oct 28, 2025Updated 4 months ago
- Train and visualise a latent variable model of moving objects.☆16Apr 28, 2020Updated 5 years ago
- ☆32Jan 9, 2025Updated last year
- An awesome 3DGS models library☆19Apr 23, 2024Updated last year
- Bird’s-eye view map from monocular cameras using BEVFormer + HOP methods.☆15Jan 17, 2024Updated 2 years ago
- GLOMAP - Global Structured-from-Motion Revisited☆16Dec 5, 2024Updated last year
- Code for experiments in the paper: "Compositional Reinforcement Learning from Logical Specifications" (https://arxiv.org/abs/2106.13906).☆16Oct 26, 2021Updated 4 years ago
- Code Release for Strap Paper☆23Jan 29, 2026Updated last month
- OneTo3D: One Image to Editable Dynamic 3D Model and Video Generation☆15May 15, 2024Updated last year
- [CVPR 2025] Zero-1-to-A: Zero-Shot One Image to Animatable Head Avatars Using Video Diffusion☆43Mar 21, 2025Updated 11 months ago
- [ACL2025 Oral & Award] Evaluate Image/Video Generation like Humans - Fast, Explainable, Flexible☆121Aug 10, 2025Updated 6 months ago
- Benchmarking physical understanding in generative video models☆247Feb 2, 2026Updated last month
- ☆103Jul 24, 2024Updated last year
- [ACL'25 Oral] Code for the paper "UrbanVideo-Bench: Benchmarking Vision-Language Models on Embodied Intelligence with Video Data in Urban…☆26Jul 15, 2025Updated 7 months ago
- ☆33Jul 9, 2025Updated 7 months ago
- Incooperating depth information into NeRF☆13Jan 12, 2023Updated 3 years ago
- ☆19Sep 19, 2024Updated last year
- [ACL 2025 Findings] Text2World: Benchmarking Large Language Models for Symbolic World Model Generation☆28Feb 25, 2025Updated last year
- LLGS: Illuminating Gaussian Splatting via absorptance Modulation☆20Oct 16, 2024Updated last year