XingruiWang/Spatial457

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/XingruiWang/Spatial457)

XingruiWang / Spatial457

[CVPR'25 Highlight] A VQA benchmark for 6D spatial reasoning.

☆20

Alternatives and similar repositories for Spatial457

Users that are interested in Spatial457 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Beckschen / spatialcode
View on GitHub
Open studio for "Thinking with Spatial Code" (https://arxiv.org/pdf/2603.05591)
☆20Mar 18, 2026Updated 4 months ago
wufeim / LychSim
View on GitHub
A controllable and interactive simulation framework for vision research.
☆16May 25, 2026Updated 2 months ago
XingruiWang / DynSuperCLEVR
View on GitHub
A video question answering dataset that focuses on the dynamics properties of objects (velocity, acceleration) and their collisions withi…
☆20Apr 23, 2025Updated last year
SarahWeiii / Blender_cv
View on GitHub
This repo provides tutorials and a library to help CV researchers to generate data using blender.
☆15Feb 2, 2020Updated 6 years ago
haoningwu3639 / SpatialScore
View on GitHub
[CVPR 2026 Highlight] SpatialScore: Towards Comprehensive Evaluation for Spatial Intelligence
☆84May 28, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Lzq5 / Video-Text-Alignment
View on GitHub
☆28Jul 18, 2025Updated last year
danaesavi / ImageChain
View on GitHub
This repository is associated with the research paper titled ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large…
☆15Jun 4, 2025Updated last year
InternLM / ETCHR
View on GitHub
A question-conditioned, reasoning-aware image editor designed to serve as a decoupled visual reasoning assistant for Multimodal Large Lan…
☆23May 25, 2026Updated 2 months ago
uvavision / SyViC
View on GitHub
[ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data
☆13Sep 30, 2023Updated 2 years ago
jkli1998 / T-CAR
View on GitHub
Code for paper 'Zero-Shot Scene Graph Generation via Triplet Calibration and Reduction' （TOMM 2023）
☆10Sep 6, 2025Updated 10 months ago
arubique / OCCAM
View on GitHub
This is an implementation of the paper "Are We Done with Object-Centric Learning?"
☆14Jun 21, 2026Updated last month
JiahaoPlus / EvoWorld
View on GitHub
EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memory
☆71Jan 13, 2026Updated 6 months ago
Lizw14 / Super-CLEVR
View on GitHub
Code for paper "Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning"
☆47Feb 19, 2026Updated 5 months ago
beichenzbc / BoostStep
View on GitHub
official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"
☆37Jan 21, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
HighwayWu / ST-DDL
View on GitHub
☆16Feb 27, 2025Updated last year
ahclab / turntaking
View on GitHub
☆13Feb 16, 2024Updated 2 years ago
qizekun / OmniSpatial
View on GitHub
[ICLR 2026] OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models
☆89Jan 21, 2026Updated 6 months ago
scottgeng00 / realtalk
View on GitHub
The official implementation of the paper "Affective Faces for Goal-Driven Dyadic Communication."
☆15Jan 27, 2023Updated 3 years ago
ArchiMickey / ArchiRF
View on GitHub
☆10Sep 21, 2024Updated last year
PoCInnovation / Deep-PoC
View on GitHub
Deep-PoC is a deepFake detection tool designed to detect deepfakes from videos or images using artificial intelligence.
☆14Sep 23, 2021Updated 4 years ago
sigmike / vncthumbnailviewer
View on GitHub
Viewer for Observing Multiple Computers using VNC
☆19Feb 13, 2010Updated 16 years ago
InternLM / Spatial-SSRL
View on GitHub
[CVPR 2026] Official release of "Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning"
☆133Apr 7, 2026Updated 3 months ago
OmniMMI / M4
View on GitHub
[CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts
☆19Apr 2, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
nitin-ppnp / SMPL.jl
View on GitHub
Julia implementation of SMPL family of 3D human models.
☆16Jun 17, 2026Updated last month
YixiangChen515 / FlowWAM
View on GitHub
Official repo of "FlowWAM: Optical Flow as a Unified Action Representation for World Action Models"
☆36Jul 15, 2026Updated 2 weeks ago
3DCoMPaT200 / 3DCoMPaT200
View on GitHub
☆15Feb 13, 2025Updated last year
RobertCsordas / silent-xps
View on GitHub
Turns off Dell XPS 15 fan when temperatures are low enough.
☆12Jun 4, 2019Updated 7 years ago
CaiHaozhong / laplacian-mesh-editing
View on GitHub
Implementation of Laplacian Surface Editing from O. Sorkine and D. Cohen-Or.
☆20Jun 20, 2019Updated 7 years ago
clf28 / x-flux-ip-adapter
View on GitHub
The IP-Adapter training scripts and inference for Flux Model, which is implemented based on X-Lab
☆17Oct 1, 2024Updated last year
junting / odas
View on GitHub
Online Detection of Action Start in Untrimmed, Streaming Videos
☆12Sep 1, 2018Updated 7 years ago
martius-lab / cluster_utils
View on GitHub
☆14Apr 15, 2025Updated last year
ZJU-REAL / CoVerRL
View on GitHub
[ACL 2026 main] CoVerRL: Breaking the Consensus Trap in Label-Free Reasoning via Generator-Verifier Co-Evolution
☆27Apr 18, 2026Updated 3 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Beckschen / ViTamin
View on GitHub
[CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"
☆211Jun 9, 2024Updated 2 years ago
hibetterheyj / v4r-plot
View on GitHub
For a better scientific drawing pipeline in MATLAB 😀 or Python
☆16May 30, 2021Updated 5 years ago
manga109 / public-annotations
View on GitHub
Various annotations of Manga109 dataset
☆13Apr 23, 2025Updated last year
collinskatie / awesome-inverse-graphics
View on GitHub
Curated list of papers and resources related to inverse graphics!
☆15Mar 27, 2021Updated 5 years ago
ShaoqLin / DiscoSG
View on GitHub
[EMNLP 2025 Outstanding Paper Award] Official repo for DiscoSG: Towards Discourse-Level Text Scene Graph Parsing through Iterative Graph …
☆22Nov 16, 2025Updated 8 months ago
marthaflinderslewis / clip-binding
View on GitHub
Code to reproduce the experiments in the paper: Does CLIP Bind Concepts? Probing Compositionality in Large Image Models.
☆16Oct 14, 2023Updated 2 years ago
InternLM / StarBench
View on GitHub
[ICLR 2026] An official implementation of "STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence"
☆42Apr 19, 2026Updated 3 months ago