UCSB-NLP-Chang/Visual-Spatial-Planning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/UCSB-NLP-Chang/Visual-Spatial-Planning)

UCSB-NLP-Chang / Visual-Spatial-Planning

Official release of the benchmark in paper "VSP: Diagnosing the Dual Challenges of Perception and Reasoning in Spatial Planning Tasks for MLLMs"

☆16

Alternatives and similar repositories for Visual-Spatial-Planning

Users that are interested in Visual-Spatial-Planning are comparing it to the libraries listed below

Sorting:

UCSB-NLP-Chang / ULD
View on GitHub
Implementation of paper 'Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework from Logit Difference' [NeurIPS'24…
☆26Jun 14, 2024Updated last year
Manoj4689 / Genetic-algorithms
View on GitHub
☆10Oct 17, 2023Updated 2 years ago
UCSB-NLP-Chang / SemanticSmooth
View on GitHub
Implementation of paper 'Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing'
☆22Jun 9, 2024Updated last year
auspicious3000 / ProsodyLM
View on GitHub
ProsodyLM: Uncovering the Emerging Prosody Processing Capabilities in Speech Language Models
☆34Nov 18, 2025Updated 3 months ago
IretonLiu / mine-pddl
View on GitHub
☆22Jun 11, 2024Updated last year
S1s-Z / CANOE
View on GitHub
[AAAI'26, Oral] Code for "Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learni…
☆43Jul 16, 2025Updated 7 months ago
portal-cornell / robotouille
View on GitHub
☆39Updated this week
RiTUAL-MBZUAI / Font-prediction-dataset
View on GitHub
This is a data repository for the ACL 2020 paper: "Let Me Choose: From Verbal Context to Font Selection"
☆10May 5, 2020Updated 5 years ago
MohitShridhar / oculus_gazebo_navigator
View on GitHub
Oculus-Rift - Gazebo Navigator (PS3 controller & keyboard op)
☆11Oct 9, 2014Updated 11 years ago
forestagostinelli / deepxube
View on GitHub
Learn a domain-specific heuristic function in a domain-independent fashion to solve pathfinding problems.
☆10Updated this week
dsgiitr / Emotion-Recognition
View on GitHub
Emotion Recognition
☆10Oct 22, 2017Updated 8 years ago
OPPO-Mente-Lab / attention-mask-control
View on GitHub
code for paper "Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models"
☆46Sep 21, 2023Updated 2 years ago
bowang-lab / NanoMASK
View on GitHub
mouse pet-ct image segmentation
☆12Feb 19, 2023Updated 3 years ago
OoDBag / VisTA
View on GitHub
VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection
☆25May 31, 2025Updated 9 months ago
Mohamedelrefaie / TransonicSurrogate
View on GitHub
Surrogate Modeling of the Aerodynamic Performance for Transonic Regime
☆13Feb 12, 2024Updated 2 years ago
jonnypei / acl23-preadd
View on GitHub
☆12Jul 25, 2023Updated 2 years ago
marlinprotocol / mev-inspect-py
View on GitHub
☆12Aug 31, 2023Updated 2 years ago
marcelbinz / meta-learned-models
View on GitHub
☆13Mar 21, 2023Updated 2 years ago
HKUST-KnowComp / ActPlan-1K
View on GitHub
☆10Oct 7, 2024Updated last year
namhkoh / BAD-BiAs-Detection-in-LLMs
View on GitHub
BAD: BiAs Detection for Large Language Models in the context of candidate screening (EECS 692)
☆12Feb 14, 2024Updated 2 years ago
SmartDataAnalytics / kgirnet
View on GitHub
Scripts for KGIRNet model for ESWC
☆10Jul 6, 2023Updated 2 years ago
UCSB-NLP-Chang / Prereq_tune
View on GitHub
Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"
☆11Jan 10, 2025Updated last year
pi-tau / vae
View on GitHub
Pytorch implementation of a Variational Autoencoder trained on CIFAR-10. The encoder and decoder modules are modelled using a resnet-styl…
☆16Jan 29, 2024Updated 2 years ago
jiacheng-xu / sum-interpret
View on GitHub
Code for Dissecting Generation Modes for Abstractive Summarization Models via Ablation and Attribution (ACL2021)
☆13Jun 2, 2021Updated 4 years ago
OpenGVLab / VRBench
View on GitHub
[ICCV 2025] A Benchmark for Multi-Step Reasoning in Long Narrative Videos
☆24Aug 8, 2025Updated 6 months ago
StanfordVL / b1k-baselines
View on GitHub
Repo for running various baselines with Behavior-1K
☆33Nov 7, 2025Updated 3 months ago
30hours / cadquery2web
View on GitHub
A browser based CadQuery server
☆12Feb 18, 2025Updated last year
rleap-project / dlplan
View on GitHub
A library for constructing and evaluating state features made up of description logics for planning.
☆12Mar 17, 2025Updated 11 months ago
thisismy-github / instant-replay-suite
View on GitHub
(Obsolete due to Steam's new Game Recording feature) A suite of tools for formatting and editing clips saved with Geforce Experience's "I…
☆14Jun 27, 2024Updated last year
google / belief-localization
View on GitHub
This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Ca…
☆61May 9, 2023Updated 2 years ago
code-cse / Face-Detection-SSD
View on GitHub
This repository contains the code for Face detection using SSD.
☆13Jun 10, 2020Updated 5 years ago
TIGER-AI-Lab / VideoEval-Pro
View on GitHub
More reliable Video Understanding Evaluation
☆14Sep 23, 2025Updated 5 months ago
stdKonjac / DeepComplexCRN
View on GitHub
☆13Mar 22, 2021Updated 4 years ago
alper111 / DeepSym
View on GitHub
Learning effect regulated object categories
☆15Nov 4, 2025Updated 3 months ago
worv-ai / canvas
View on GitHub
CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction
☆18Oct 20, 2025Updated 4 months ago
hkust-nlp / RL-Verifier-Robustness
View on GitHub
From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.
☆25Oct 7, 2025Updated 4 months ago
shunk031 / allennlp-shiba-model
View on GitHub
AllenNLP integration for Shiba: Japanese CANINE model
☆12Jun 26, 2021Updated 4 years ago
goodrobots / visiond
View on GitHub
Python/Gstreamer based project to stream video from embedded system cameras in various ways
☆12Sep 20, 2021Updated 4 years ago
Vincent-ZHQ / Comprehensive-Long-Video-Understanding-Survey
View on GitHub
A survey on MM-LLMs for long video understanding: From Seconds to Hours: Reviewing MultiModal Large Language Models on Comprehensive Long…
☆18Sep 12, 2025Updated 5 months ago