DeepPHY: Benchmarking Agentic VLMs on Physical Reasoning
☆166Nov 16, 2025Updated 5 months ago
Alternatives and similar repositories for DeepPHY
Users that are interested in DeepPHY are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- IPHYRE: Interactive Physical Reasoning, ICLR 2024☆19Oct 18, 2024Updated last year
- This repo is a live list of papers on game playing and large multimodality model - "A Survey on Game Playing Agents and Large Models: Met…☆164Sep 3, 2024Updated last year
- CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics☆28Nov 1, 2025Updated 5 months ago
- [CVPR 2025] VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning☆13Jun 7, 2025Updated 10 months ago
- [NAACL 2025 Main] Official implementation of "From Allies to Adversaries: Manipulating LLM Tool Scheduling through Adversarial Injection"…☆21Jun 11, 2025Updated 10 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Code for the robot-assisted feeding project at EmPRISE Lab☆28Apr 8, 2026Updated last week
- Public Evaluation Result Archieve for BFCL☆29Dec 17, 2025Updated 3 months ago
- ☆33Apr 9, 2025Updated last year
- Being-H0.5: Scaling Human-Centric Robot Learning for Cross-Embodiment Generalization