cambridgeltl / topviewrsLinks
TopViewRS: Vision-Language Models as Top-View Spatial Reasoners (EMNLP 2024 Oral)
☆15Updated 6 months ago
Alternatives and similar repositories for topviewrs
Users that are interested in topviewrs are comparing it to the libraries listed below
Sorting:
- ☆20Updated 10 months ago
- Evaluate Multimodal LLMs as Embodied Agents☆56Updated 10 months ago
- [ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents☆25Updated 2 months ago
- ☆33Updated last year
- [NeurIPS'25] SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning☆38Updated 2 months ago
- [ACL 2024] PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain☆105Updated last year
- ☆31Updated last year
- ☆51Updated 8 months ago
- LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied Agents (ICLR 2024)☆85Updated 7 months ago
- ☆133Updated last year
- The code of the paper "DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects"☆19Updated 8 months ago
- Official Code for "Learning to Reason via Mixture-of-Thought for Logical Reasoning"☆25Updated last month
- Official Repo of LangSuitE☆84Updated last year
- ☆56Updated last year
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆27Updated 4 months ago
- ☆21Updated 8 months ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Updated last year
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆30Updated last year
- Scaffold Prompting to promote LMMs☆44Updated last year
- Sotopia-RL: Reward Design for Social Intelligence☆46Updated 4 months ago
- Bayes-Adaptive RL for LLM Reasoning☆43Updated 7 months ago
- The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism☆30Updated last year
- Plancraft is a minecraft environment and agent suite to test planning capabilities in LLMs☆23Updated 2 months ago
- [ICML 2024] Language Models Represent Beliefs of Self and Others☆35Updated last year
- This repo contains the code for "MEGA-Bench Scaling Multimodal Evaluation to over 500 Real-World Tasks" [ICLR 2025]☆77Updated 6 months ago
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆50Updated 3 weeks ago
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆81Updated last year
- Dream-VL and Dream-VLA, a diffusion VLM and a diffusion VLA.☆83Updated this week
- ☆16Updated 3 months ago
- [ICLR 2023] CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding☆46Updated 7 months ago