hanxunyu/Inst3D-LMM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hanxunyu/Inst3D-LMM)

hanxunyu / Inst3D-LMM

[CVPR 2025 Highlight] Official code repository for "Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuning"

☆133

Alternatives and similar repositories for Inst3D-LMM

Users that are interested in Inst3D-LMM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hanxunyu / DepthVLM
View on GitHub
🔥 Official code repository for "Unlocking Dense Metric Depth Estimation in VLMs"
☆155Updated this week
songw-zju / Scribble2Scene
View on GitHub
The official implementation of "Label-efficient Semantic Scene Completion with Scribble Annotations" (IJCAI 2024)
☆14Jul 27, 2024Updated 2 years ago
hanxunyu / Stream3D-VLM
View on GitHub
[ECCV 2026🔥] Official code repository for "Stream3D-VLM: Online 3D Spatial Understanding with Incremental Geometry Priors"
☆47Jun 23, 2026Updated last month
alibaba-damo-academy / EOCBench
View on GitHub
[NeurIPS 2025] EOC-Bench, an innovative benchmark designed to systematically evaluate object-centric embodied cognition in dynamic egocen…
☆22Jun 17, 2025Updated last year
songw-zju / HASSC
View on GitHub
The official implementation of "Not All Voxels Are Equal: Hardness-Aware Semantic Scene Completion with Self-Distillation" (CVPR 2024)
☆28Jul 27, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Allenxinn / DecoVLN
View on GitHub
[CVPR 2026] Official code repository for : "DecoVLN: Decoupling Observation, Reasoning, and Correction for Vision-and-Language Navigation…
☆22Mar 19, 2026Updated 4 months ago
hanxunyu / VisionTrim
View on GitHub
[ICLR 2026] Official code repository for "⚡️VisionTrim: Unified Vision Token Compression for Training-Free MLLM Acceleration"
☆56Jun 17, 2026Updated last month
CircleRadon / TokenPacker
View on GitHub
The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM", IJCV2025
☆280May 26, 2025Updated last year
CircleRadon / APro
View on GitHub
The code for "Label-efficient Segmentation via Affinity Propagation". [NeurIPS2023]
☆67Mar 4, 2024Updated 2 years ago
alibaba-damo-academy / PixelRefer
View on GitHub
The code for PixelRefer & VideoRefer
☆352Nov 16, 2025Updated 8 months ago
agnJason / FMHR
View on GitHub
Fine-grained Multi-view Hand Reconstruction Using Inverse Rendering
☆11Jul 8, 2024Updated 2 years ago
zhoujiahuan1991 / ICML2025-GAPrompt
View on GitHub
Official implementation of paper "GAPrompt: Geometry-Aware Point Cloud Prompt for 3D Vision Model", ICML 2025
☆17Dec 25, 2025Updated 7 months ago
xiaolul2 / MGMap
View on GitHub
[CVPR2024] The code for "MGMap: Mask-Guided Learning for Online Vectorized HD Map Construction"
☆119Apr 13, 2024Updated 2 years ago
DCDmllm / Awesome-Object-Centric-LMMs
View on GitHub
☆49Apr 14, 2026Updated 3 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
agnJason / XHand
View on GitHub
Official pytorch implementation of "XHand: Real-time Expressive Hand Avatar"
☆86Jul 31, 2024Updated last year
songw-zju / LiDAR2Map
View on GitHub
The official implementation of "LiDAR2Map: In Defense of LiDAR-Based Semantic Map Construction Using Online Camera Distillation" (CVPR 20…
☆92Apr 6, 2024Updated 2 years ago
facebookresearch / univlg
View on GitHub
Unifying 2D and 3D Vision-Language Understanding
☆127Jul 2, 2026Updated 3 weeks ago
DCDmllm / InstructSAM
View on GitHub
The code for "InstructSAM: Segment Any Instance with Any Instructions"
☆96Updated this week
lslrh / DMA
View on GitHub
Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024
☆32Jul 18, 2024Updated 2 years ago
Hoyyyaard / LSceneLLM
View on GitHub
☆75Mar 29, 2025Updated last year
agnJason / PianoMotion10M
View on GitHub
Code release for PianoMotion10M
☆117Mar 28, 2025Updated last year
xiaolul2 / UIGenMap
View on GitHub
[CVPR2025] The code for "Uncertainty-Instructed Structure Injection for Generalizable HD Map Construction."
☆23Oct 19, 2025Updated 9 months ago
ZCMax / ScanReason
View on GitHub
[ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities
☆85Oct 10, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Haochen-Wang409 / ross3d
View on GitHub
[ICCV'25] Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness
☆70Jul 22, 2025Updated last year
djiajunustc / 3D-LLaVA
View on GitHub
[CVPR 2025] 3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer
☆101May 26, 2025Updated last year
fudan-zvg / UniUGG
View on GitHub
UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding. Accepted to ICLR 2026.
☆63Jul 16, 2026Updated last week
xiaolul2 / DynFlowDrive
View on GitHub
Code implementation of DynFlowDrive: Flow-Based Dynamic World Modeling for Autonomous Driving
☆24Mar 23, 2026Updated 4 months ago
Visual-AI / 3DRS
View on GitHub
[NeurIPS 2025] 3DRS: MLLMs Need 3D-Aware Representation Supervision for Scene Understanding
☆158Dec 9, 2025Updated 7 months ago
CircleRadon / Osprey
View on GitHub
[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"
☆844Aug 19, 2025Updated 11 months ago
3dlg-hcvc / multi3drefer
View on GitHub
[ICCV 2023] Multi3DRefer: Grounding Text Description to Multiple 3D Objects
☆98Mar 26, 2026Updated 4 months ago
WentingXu3o3 / TB-HSU
View on GitHub
[AAAI 2025] Official data and code for "TB-HSU: Hierarchical 3D Scene Understanding with Contextual Affordances"
☆15Sep 11, 2025Updated 10 months ago
ZCMax / LLaVA-3D
View on GitHub
[ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World
☆387Oct 21, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
KuanchihHuang / Reason3D
View on GitHub
[3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model
☆124May 30, 2025Updated last year
LaVi-Lab / Video-3D-LLM
View on GitHub
[CVPR 2025] The code for paper ''Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding''.
☆220Jun 4, 2025Updated last year
xiaolul2 / Interp3D
View on GitHub
[ICLR2026] The code for "Interp3D: Correspondence-Aware Interpolation for Generative Textured 3D Morphing."
☆32Jan 21, 2026Updated 6 months ago
LiWentomng / Box-supervised-instance-segmentation
View on GitHub
Awesome box-supervised instance segmentation papers.
☆79Sep 19, 2023Updated 2 years ago
ZzZZCHS / Chat-Scene
View on GitHub
[NeurIPS 2024 & TPAMI 2026] Chat-Scene: Bridging 3D Scene and Large Language Models with Object Identifiers
☆216Apr 12, 2026Updated 3 months ago
zijinxuxu / PDFNet
View on GitHub
RGB-D fusion for two-hand reconstruction
☆29Aug 6, 2024Updated last year
Allenxinn / AgentVLN
View on GitHub
☆176Jul 2, 2026Updated 3 weeks ago