YunzeMan/Situation3D

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/YunzeMan/Situation3D)

YunzeMan / Situation3D

[CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning

☆44

Alternatives and similar repositories for Situation3D

Users that are interested in Situation3D are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MSR3D / MSR3D
View on GitHub
[NeurIPS 2024] MSR3D: Multimodal Situated Reasoning in 3D Scenes
☆76Dec 2, 2025Updated 7 months ago
AmrinKareem / PARIS3D
View on GitHub
Official implementation of PARIS3D (Accepted to ECCV 2024).
☆27Sep 25, 2024Updated last year
PQ3D / PQ3D
View on GitHub
Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"
☆85Aug 2, 2024Updated last year
KuanchihHuang / Reason3D
View on GitHub
[3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model
☆124May 30, 2025Updated last year
YunzeMan / Lexicon3D
View on GitHub
[NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding
☆102Feb 2, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ZCMax / ScanReason
View on GitHub
[ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities
☆85Oct 10, 2024Updated last year
sg-3d / sg3d
View on GitHub
☆55Oct 3, 2024Updated last year
ZCMax / LLaVA-3D
View on GitHub
[ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World
☆387Oct 21, 2025Updated 9 months ago
ziqipang / StreamingForecasting
View on GitHub
[IROS 2023] "Streaming Motion Forecasting for Autonomous Driving"
☆41Oct 2, 2023Updated 2 years ago
ZzZZCHS / Chat-Scene
View on GitHub
[NeurIPS 2024 & TPAMI 2026] Chat-Scene: Bridging 3D Scene and Large Language Models with Object Identifiers
☆216Apr 12, 2026Updated 3 months ago
JasonQSY / AffordanceLLM
View on GitHub
Code for "AffordanceLLM: Grounding Affordance from Vision Language Models"
☆14Oct 18, 2024Updated last year
VisionXLab / SpaCE-10
View on GitHub
[ICLR 2026] SpaCE-10: A Comprehensive Benchmark for Multimodal Large Language Models in Compositional Spatial Intelligence
☆20Jan 26, 2026Updated 5 months ago
TangYuan96 / GreenPLM
View on GitHub
[AAAI 2025] More Text, Less Point: Towards 3D Data-Efficient Point-Language Understanding
☆28Mar 20, 2026Updated 4 months ago
LaVi-Lab / Video-3D-LLM
View on GitHub
[CVPR 2025] The code for paper ''Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding''.
☆219Jun 4, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
eslambakr / CoT3D_VG
View on GitHub
Chain_of_Thoughts_3D_Visual_Grounding
☆21Apr 20, 2024Updated 2 years ago
InternRobotics / Grounded_3D-LLM
View on GitHub
Code&Data for Grounded 3D-LLM with Referent Tokens
☆136Jan 5, 2025Updated last year
Jingkang50 / PSG4D
View on GitHub
4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)
☆122Mar 13, 2025Updated last year
SooLab / Part2Object
View on GitHub
[ECCV 2024] The official PyTorch implementation of the "Part2Object: Hierarchical Unsupervised 3D Instance Segmentation".
☆26Sep 12, 2024Updated last year
HanchenTai / OV-SAM3D
View on GitHub
Open-Vocabulary SAM3D: Understand Any 3D Scene
☆44Jun 9, 2025Updated last year
ZQS1943 / GLEN
View on GitHub
code for "GLEN: General-Purpose Event Detection for Thousands of Types"
☆13Nov 6, 2023Updated 2 years ago
ayushjain1144 / odin
View on GitHub
Code for the paper: "ODIN: A Single Model for 2D and 3D Segmentation" (CVPR 2024)
☆177Feb 27, 2026Updated 4 months ago
sosppxo / RG-SAN
View on GitHub
[NeurIPS 2024 Oral] RG-SAN: Rule-Guided Spatial Awareness Network for End-to-End 3D Referring Expression Segmentation
☆20Dec 22, 2024Updated last year
lslrh / DMA
View on GitHub
Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024
☆32Jul 18, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
boschresearch / Open3DSG
View on GitHub
[CVPR 2024] Open3DSG: Open-Vocabulary 3D Scene Graphs from Point Clouds with Queryable Objects and Open-Set Relationships
☆166Sep 16, 2024Updated last year
Open3DA / LL3DA
View on GitHub
[CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Langu…
☆319Jul 17, 2024Updated 2 years ago
InternRobotics / EmbodiedScan
View on GitHub
[CVPR 2024 & NeurIPS 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI
☆672Jun 13, 2025Updated last year
scene-verse / SceneVerse
View on GitHub
Official implementation of ECCV24 paper "SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding"
☆288Mar 19, 2025Updated last year
zsc000722 / PPT
View on GitHub
☆20Sep 27, 2024Updated last year
evelinehong / 3D-Concept-Grounding
View on GitHub
Code Release of "3D Concept Grounding on Neural Fields (NeurIPS2022)"
☆15Feb 13, 2023Updated 3 years ago
ymxlzgy / echoscene
View on GitHub
[ECCV 2024] EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion.
☆102Jun 3, 2024Updated 2 years ago
Fsoft-AIC / Open-Vocabulary-Affordance-Detection-in-3D-Point-Clouds
View on GitHub
[IROS 2023] Open-Vocabulary Affordance Detection in 3d Point Clouds
☆89Sep 4, 2024Updated last year
cheolhong0916 / contrastive-probing
View on GitHub
☆15Jun 19, 2026Updated last month
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
djiajunustc / 3D-LLaVA
View on GitHub
[CVPR 2025] 3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer
☆101May 26, 2025Updated last year
Fsoft-AIC / Language-Conditioned-Affordance-Pose-Detection-in-3D-Point-Clouds
View on GitHub
[ICRA 2024] Language-Conditioned Affordance-Pose Detection in 3D Point Clouds
☆54Jan 10, 2025Updated last year
SceneFun3D / scenefun3d
View on GitHub
SceneFun3D ToolKit
☆181Apr 17, 2025Updated last year
Jerrypiglet / indoorInverse
View on GitHub
Rui Zhu's implementation of CVPR2020 work Inverse Rendering for Complex Indoor Scene by Li et.al
☆13Jan 17, 2023Updated 3 years ago
tev-fbk / fun3du
View on GitHub
[CVPR25 Highlight] Official implementation of Fun3DU, a method for functional understanding and segmentation in 3D scenes
☆51Sep 30, 2025Updated 9 months ago
CurryYuan / PhraseRefer
View on GitHub
[TNNLS] Toward Explainable and Fine-Grained 3D Grounding through Referring Textual Phrases
☆17Jul 10, 2025Updated last year
johnson111788 / SpatialReasoner
View on GitHub
Training recipe for SpatialReasoner [NeurIPS 2025]
☆45Apr 5, 2026Updated 3 months ago