djiajunustc/3D-LLaVA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/djiajunustc/3D-LLaVA)

djiajunustc / 3D-LLaVA

[CVPR 2025] 3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer

☆101

Alternatives and similar repositories for 3D-LLaVA

Users that are interested in 3D-LLaVA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LaVi-Lab / Video-3D-LLM
View on GitHub
[CVPR 2025] The code for paper ''Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding''.
☆220Jun 4, 2025Updated last year
Hoyyyaard / LSceneLLM
View on GitHub
☆75Mar 29, 2025Updated last year
KuanchihHuang / Reason3D
View on GitHub
[3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model
☆124May 30, 2025Updated last year
LaVi-Lab / VG-LLM
View on GitHub
The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'
☆248Nov 28, 2025Updated 8 months ago
Haochen-Wang409 / ross3d
View on GitHub
[ICCV'25] Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness
☆70Jul 22, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ZCMax / LLaVA-3D
View on GitHub
[ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World
☆387Oct 21, 2025Updated 9 months ago
THU-SI / Spatial-MLLM
View on GitHub
[NeurIPS 2025 Spotlight] Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence
☆480Feb 5, 2026Updated 5 months ago
zoezheng126 / Spatio-Temporal-LLM
View on GitHub
☆19Aug 7, 2025Updated 11 months ago
hanxunyu / Inst3D-LMM
View on GitHub
[CVPR 2025 Highlight] Official code repository for "Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuning…
☆133Jan 30, 2026Updated 5 months ago
3DLLM-Mem / 3DLLM-Mem
View on GitHub
☆27Jun 5, 2025Updated last year
Visual-AI / 3DRS
View on GitHub
[NeurIPS 2025] 3DRS: MLLMs Need 3D-Aware Representation Supervision for Scene Understanding
☆158Dec 9, 2025Updated 7 months ago
AIGeeksGroup / 3D-R1
View on GitHub
3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding
☆414Jul 20, 2026Updated last week
sosppxo / RG-SAN
View on GitHub
[NeurIPS 2024 Oral] RG-SAN: Rule-Guided Spatial Awareness Network for End-to-End 3D Referring Expression Segmentation
☆20Dec 22, 2024Updated last year
heshuting555 / RefMask3D
View on GitHub
[ACM MM-2024] RefMask3D: Language-Guided Transformer for 3D Referring Segmentation
☆65Jul 29, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
InternRobotics / Grounded_3D-LLM
View on GitHub
Code&Data for Grounded 3D-LLM with Referent Tokens
☆136Jan 5, 2025Updated last year
MatchLab-Imperial / Hypo3D
View on GitHub
ICML 2025 Hypo3D: Exploring Hypothetical Reasoning in 3D
☆46Jul 17, 2025Updated last year
sg-3d / sg3d
View on GitHub
☆55Oct 3, 2024Updated last year
YunzeMan / Situation3D
View on GitHub
[CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning
☆44Dec 9, 2024Updated last year
facebookresearch / univlg
View on GitHub
Unifying 2D and 3D Vision-Language Understanding
☆127Jul 2, 2026Updated 3 weeks ago
Yangr116 / VST
View on GitHub
[ECCV2026] Visual Spatial Tuning
☆200Mar 25, 2026Updated 4 months ago
3dlg-hcvc / vigil3d
View on GitHub
ViGiL3D: A Linguistically Diverse Dataset for 3D Visual Grounding
☆20Aug 8, 2025Updated 11 months ago
LogosRoboticsGroup / SPAR
View on GitHub
From Flatland to Space (SPAR). Accepted to NeurIPS 2025 Datasets & Benchmarks. A large-scale dataset & benchmark for 3D spatial perceptio…
☆90Jan 5, 2026Updated 6 months ago
cxmomo / GraspCoT
View on GitHub
[ICCV 2025] GraspCoT: Integrating Physical Property Reasoning for 6-DoF Grasping under Flexible Language Instructions
☆23Mar 23, 2026Updated 4 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
fudan-zvg / UniUGG
View on GitHub
UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding. Accepted to ICLR 2026.
☆63Jul 16, 2026Updated last week
PeiwenSun2000 / SpaceVista
View on GitHub
The official repo for SpaceVista: All-Scale Visual Spatial Reasoning from mm to km.
☆43May 26, 2026Updated 2 months ago
Open3DA / LL3DA
View on GitHub
[CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Langu…
☆319Jul 17, 2024Updated 2 years ago
qzp2018 / MCLN
View on GitHub
This is a PyTorch implementation of MCLN proposed by our paper "Multi-branch Collaborative Learning Network for 3D Visual Grounding"(ECCV…
☆27Oct 10, 2024Updated last year
hyzhouboy / LLaVA-4D
View on GitHub
A general large multimodal model for 4D scene understanding
☆16Jul 31, 2025Updated 11 months ago
ActiveVisionLab / Awesome-LLM-3D
View on GitHub
Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources
☆2,242Apr 16, 2026Updated 3 months ago
liudaizong / Awesome-3D-Visual-Grounding
View on GitHub
😎 up-to-date & curated list of awesome 3D Visual Grounding papers, methods & resources.
☆283Jan 14, 2026Updated 6 months ago
lifuguan / GP-NeRF
View on GitHub
[CVPR 2024 Highlight] GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding
☆28Jul 26, 2024Updated 2 years ago
dk-liang / UniSeg3D
View on GitHub
[NeurIPS 2024] A Unified Framework for 3D Scene Understanding
☆179Jul 7, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ZzZZCHS / Chat-Scene
View on GitHub
[NeurIPS 2024 & TPAMI 2026] Chat-Scene: Bridging 3D Scene and Large Language Models with Object Identifiers
☆216Apr 12, 2026Updated 3 months ago
WU-CVGL / GS-Reasoner
View on GitHub
Reasoning in Space via Grounding in the World (ICLR 2025)
☆56Nov 3, 2025Updated 8 months ago
VITA-Group / VLM-3R
View on GitHub
[CVPR 2026] VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction
☆431Jul 15, 2026Updated 2 weeks ago
ZCMax / ScanReason
View on GitHub
[ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities
☆85Oct 10, 2024Updated last year
lifuguan / LangSurf
View on GitHub
[Arxiv'24] LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding
☆44Aug 18, 2025Updated 11 months ago
zhoujiahuan1991 / ICML2025-GAPrompt
View on GitHub
Official implementation of paper "GAPrompt: Geometry-Aware Point Cloud Prompt for 3D Vision Model", ICML 2025
☆17Dec 25, 2025Updated 7 months ago
MTU3D / MTU3D
View on GitHub
☆266Aug 6, 2025Updated 11 months ago