cambridgeltl / topviewrsLinks
TopViewRS: Vision-Language Models as Top-View Spatial Reasoners (EMNLP 2024 Oral)
☆14Updated 3 months ago
Alternatives and similar repositories for topviewrs
Users that are interested in topviewrs are comparing it to the libraries listed below
Sorting:
- ☆19Updated 6 months ago
- [ACL 2024] PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain☆104Updated last year
- [ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents☆17Updated last month
- Official Code for "Learning to Reason via Mixture-of-Thought for Logical Reasoning"☆24Updated 3 months ago
- Official repository for "RLVR-World: Training World Models with Reinforcement Learning", https://arxiv.org/abs/2505.13934☆85Updated 3 months ago
- Bayes-Adaptive RL for LLM Reasoning☆39Updated 3 months ago
- Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆41Updated last month
- ☆48Updated 4 months ago
- Evaluate Multimodal LLMs as Embodied Agents☆54Updated 7 months ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Updated 8 months ago
- LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied Agents (ICLR 2024)☆79Updated 3 months ago
- DeepPHY: Benchmarking Agentic VLMs on Physical Reasoning☆153Updated 3 weeks ago
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆35Updated 3 weeks ago
- ☆21Updated 4 months ago
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆29Updated 9 months ago
- The code of the paper "DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects"☆17Updated 4 months ago
- Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks☆30Updated last month
- The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism☆30Updated last year
- ☆27Updated last year
- Official Repo of LangSuitE☆84Updated last year
- Extensive Self-Contrast Enables Feedback-Free Language Model Alignment☆20Updated last year
- Scaffold Prompting to promote LMMs☆43Updated 9 months ago
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆77Updated last year
- Codes for ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding [ICML 2025]]☆38Updated last month
- Main repo for SimWorld simulator.☆63Updated this week
- [NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆24Updated 11 months ago
- ☆53Updated 3 months ago
- Dateset Reset Policy Optimization☆30Updated last year
- ☆52Updated last year
- [NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding☆18Updated 11 months ago