XinrunXu / DeepPHYLinks
DeepPHY: Benchmarking Agentic VLMs on Physical Reasoning
☆144Updated last week
Alternatives and similar repositories for DeepPHY
Users that are interested in DeepPHY are comparing it to the libraries listed below
Sorting:
- [EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time☆78Updated 2 months ago
- ☆48Updated 3 months ago
- ☆29Updated last year
- Official repository for "RLVR-World: Training World Models with Reinforcement Learning", https://arxiv.org/abs/2505.13934☆79Updated 2 months ago
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆35Updated last year
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆15Updated 5 months ago
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆30Updated 4 months ago
- ☆52Updated 2 months ago
- Bayes-Adaptive RL for LLM Reasoning☆37Updated 3 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆47Updated 3 months ago
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆107Updated last month
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆46Updated 6 months ago
- Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆40Updated 2 weeks ago
- Geometric-Mean Policy Optimization☆68Updated 3 weeks ago
- ☆110Updated 4 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆69Updated 3 months ago
- [NeurIPS 2024] A task generation and model evaluation system for multimodal language models.