Official release of the benchmark in paper "VSP: Diagnosing the Dual Challenges of Perception and Reasoning in Spatial Planning Tasks for MLLMs"
☆16Aug 1, 2025Updated 7 months ago
Alternatives and similar repositories for Visual-Spatial-Planning
Users that are interested in Visual-Spatial-Planning are comparing it to the libraries listed below
Sorting:
- Implementation of paper 'Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework from Logit Difference' [NeurIPS'24…☆26Jun 14, 2024Updated last year
- ☆10Oct 17, 2023Updated 2 years ago
- Implementation of paper 'Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing'☆22Jun 9, 2024Updated last year
- ProsodyLM: Uncovering the Emerging Prosody Processing Capabilities in Speech Language Models☆34Nov 18, 2025Updated 3 months ago
- ☆22Jun 11, 2024Updated last year
- [AAAI'26, Oral] Code for "Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learni…☆43Jul 16, 2025Updated 7 months ago
- ☆39Updated this week
- This is a data repository for the ACL 2020 paper: "Let Me Choose: From Verbal Context to Font Selection"☆10May 5, 2020Updated 5 years ago
- Oculus-Rift - Gazebo Navigator (PS3 controller & keyboard op)☆11Oct 9, 2014Updated 11 years ago
- Learn a domain-specific heuristic function in a domain-independent fashion to solve pathfinding problems.☆10Updated this week
- Emotion Recognition☆10Oct 22, 2017Updated 8 years ago
- code for paper "Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models"☆46Sep 21, 2023Updated 2 years ago
- mouse pet-ct image segmentation☆12Feb 19, 2023Updated 3 years ago
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆25May 31, 2025Updated 9 months ago
- Surrogate Modeling of the Aerodynamic Performance for Transonic Regime☆13Feb 12, 2024Updated 2 years ago
- ☆12Jul 25, 2023Updated 2 years ago
- ☆12Aug 31, 2023Updated 2 years ago
- ☆13Mar 21, 2023Updated 2 years ago
- ☆10Oct 7, 2024Updated last year
- BAD: BiAs Detection for Large Language Models in the context of candidate screening (EECS 692)☆12Feb 14, 2024Updated 2 years ago
- Scripts for KGIRNet model for ESWC☆10Jul 6, 2023Updated 2 years ago
- Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"☆11Jan 10, 2025Updated last year
- Pytorch implementation of a Variational Autoencoder trained on CIFAR-10. The encoder and decoder modules are modelled using a resnet-styl…☆16Jan 29, 2024Updated 2 years ago
- Code for Dissecting Generation Modes for Abstractive Summarization Models via Ablation and Attribution (ACL2021)☆13Jun 2, 2021Updated 4 years ago
- [ICCV 2025] A Benchmark for Multi-Step Reasoning in Long Narrative Videos☆24Aug 8, 2025Updated 6 months ago
- Repo for running various baselines with Behavior-1K☆33Nov 7, 2025Updated 3 months ago
- A browser based CadQuery server☆12Feb 18, 2025Updated last year
- A library for constructing and evaluating state features made up of description logics for planning.☆12Mar 17, 2025Updated 11 months ago
- (Obsolete due to Steam's new Game Recording feature) A suite of tools for formatting and editing clips saved with Geforce Experience's "I…☆14Jun 27, 2024Updated last year
- This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Ca…☆61May 9, 2023Updated 2 years ago
- This repository contains the code for Face detection using SSD.☆13Jun 10, 2020Updated 5 years ago
- More reliable Video Understanding Evaluation☆14Sep 23, 2025Updated 5 months ago
- ☆13Mar 22, 2021Updated 4 years ago
- Learning effect regulated object categories☆15Nov 4, 2025Updated 3 months ago
- CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction☆18Oct 20, 2025Updated 4 months ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 4 months ago
- AllenNLP integration for Shiba: Japanese CANINE model☆12Jun 26, 2021Updated 4 years ago
- Python/Gstreamer based project to stream video from embedded system cameras in various ways☆12Sep 20, 2021Updated 4 years ago
- A survey on MM-LLMs for long video understanding: From Seconds to Hours: Reviewing MultiModal Large Language Models on Comprehensive Long…☆18Sep 12, 2025Updated 5 months ago