AlvinWen428/spatial-relation-benchmark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AlvinWen428/spatial-relation-benchmark)

AlvinWen428 / spatial-relation-benchmark

☆15

Alternatives and similar repositories for spatial-relation-benchmark

Users that are interested in spatial-relation-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ftchirou / Bemol
View on GitHub
🎵 Bemol is a free and open-source ear training app that helps music hobbyists and music students train and develop relative pitch.
☆15Jul 3, 2026Updated 2 weeks ago
EricJin2002 / SOE
View on GitHub
[ICRA 2026] SOE: Sample-Efficient Robot Policy Self-Improvement via On-Manifold Exploration
☆18Mar 2, 2026Updated 4 months ago
uvavision / SyViC
View on GitHub
[ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data
☆13Sep 30, 2023Updated 2 years ago
VincentDENGP / 3D-LR
View on GitHub
Can 3D Vision-Language Models Truly Understand Natural Language?
☆20Mar 28, 2024Updated 2 years ago
amitakamath / vl_text_encoders_are_bottlenecks
View on GitHub
Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!
☆11May 24, 2023Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
ys-zong / MIRB
View on GitHub
Benchmarking Multi-Image Understanding in Vision and Language Models
☆11Jul 29, 2024Updated last year
princeton-vl / Rel3D
View on GitHub
Official code for NeurRIPS 2020 paper "Rel3D: A Minimally Contrastive Benchmark for Grounding Spatial Relations in 3D"
☆33Dec 12, 2024Updated last year
AlvinWen428 / keyframe-focused-imitation-learning
View on GitHub
☆11Dec 13, 2021Updated 4 years ago
hukkai / liresnet
View on GitHub
[NeurIPS 2023] and [ICLR 2024] for robustness certification.
☆10Nov 30, 2024Updated last year
cha15yq / MRC-Crowd
View on GitHub
Implementation of "Semi-Supervised Crowd Counting with Contextual Modeling: Facilitating Holistic Understanding of Crowd Scenes"
☆13Oct 2, 2024Updated last year
martin-xia0 / China_Future_Md
View on GitHub
data gateway system getting real-time tick data from Shanghai Future Exchange
☆13Dec 8, 2019Updated 6 years ago
joactr / AnnoTheia
View on GitHub
AnnoTheia is a data annotation toolkit that identifies when a person speaks in a scene and transcribes their speech, also offering flexib…
☆27Jul 26, 2024Updated last year
id9502 / Option-GAIL
View on GitHub
☆12Dec 22, 2021Updated 4 years ago
XingruiWang / 3D-Aware-VQA
View on GitHub
Official Code for the NeurIPS'23 paper "3D-Aware Visual Question Answering about Parts, Poses and Occlusions"
☆21Oct 17, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ZzZZCHS / WS-3DVG
View on GitHub
[ICCV 2023] Distilling Coarse-to-fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding
☆14Oct 2, 2024Updated last year
we-chatter / wechatter
View on GitHub
wechatter: An easy Conversation AI Chatbot Framework
☆10Apr 15, 2021Updated 5 years ago
SimonSun0810 / VGGT-World
View on GitHub
☆35Jul 13, 2026Updated last week
HUuxiaobin / DiffuMatting
View on GitHub
☆18Jul 14, 2025Updated last year
VidCapBench / VidCapBench
View on GitHub
☆13May 17, 2025Updated last year
cxliu0 / Noisy-Labels-in-Computer-Vision
View on GitHub
A curated list of papers that study learning with noisy labels.
☆21Jan 22, 2024Updated 2 years ago
Ziyan-Huang / AdwU-Net
View on GitHub
MIDL2022 | AdwU-Ne: Adaptive depth and width U-Net.
☆21Mar 26, 2022Updated 4 years ago
meyerls / FruitNeRFpp
View on GitHub
[IROS25] Offical Code for "FruitNeRF++: A Generalized Multi-Fruit Counting Method Utilizing Contrastive Learning and Neural Radiance Fiel…
☆18Dec 14, 2025Updated 7 months ago
hany01rye / tiger
View on GitHub
TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics
☆23Nov 18, 2025Updated 8 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
bshfang / WorldReel
View on GitHub
WorldReel: 4D Video Generation with Consistent Geometry and Motion Modeling
☆19Apr 11, 2026Updated 3 months ago
noelshin / zutis
View on GitHub
[CVPRW'23 Best Paper Award] Zero-shot Unsupervised Transfer Instance Segmentation
☆24Aug 22, 2023Updated 2 years ago
mlevy2525 / P3PO
View on GitHub
Code for P3PO
☆20Jan 31, 2025Updated last year
McGill-NLP / diffusion-itm
View on GitHub
Code and data setup for the paper "Are Diffusion Models Vision-and-language Reasoners?"
☆33Mar 15, 2024Updated 2 years ago
KaustubhPatange / Diffuser-layerdiffuse
View on GitHub
Unofficial implementation of Layer Diffuse in diffusers
☆28Apr 3, 2024Updated 2 years ago
federicocunico / human-robot-collaboration
View on GitHub
Utilities for visualizing the human poses of CHICO dataset from "Pose Forecasting in Industrial Human-Robot Collaboration" ECCV 2022 pape…
☆11Oct 24, 2022Updated 3 years ago
msm8976 / NightReID
View on GitHub
[AAAI'25 Oral] NightReID: A Large-Scale Nighttime Person Re-Identification Benchmark
☆11Jun 10, 2025Updated last year
lyrig / TokenAR
View on GitHub
TokenAR: Multiple Subject Generation via Autoregressive Token-level enhancement
☆22Mar 4, 2026Updated 4 months ago
xlyu0106 / MACT
View on GitHub
☆19Jul 31, 2025Updated 11 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
linzhiqiu / visual_gpt_score
View on GitHub
VisualGPTScore for visio-linguistic reasoning
☆27Oct 7, 2023Updated 2 years ago
KeNiu042 / Diffusion-ReID
View on GitHub
Synthesizing Efficient Data with Diffusion Models for Person Re-Identification Pre-Training
☆11Jan 23, 2024Updated 2 years ago
CorentinDumery / 3d-counting
View on GitHub
[ICCV25 Oral] Counting Stacked Objects
☆25Jan 18, 2026Updated 6 months ago
Lion-shine / Segment-Membranes-and-Nuclei-from-Histopathological-Images-via-Nuclei-Point-level-Supervision
View on GitHub
☆12Jul 3, 2023Updated 3 years ago
LoraLinH / Semi-supervised-Crowd-Counting-via-Density-Agency
View on GitHub
Official Implement of ACM MM 2022 paper 'Semi supervised Crowd Counting via Density Agency'
☆24Sep 23, 2022Updated 3 years ago
StoreBlank / KUDA
View on GitHub
KUDA: Keypoints to Unify Dynamics Learning and Visual Prompting for Open-Vocabulary Robotic Manipulation
☆22Apr 23, 2025Updated last year
robotnav-bot / NOW
View on GitHub
☆16Mar 13, 2026Updated 4 months ago