WeitaiKang/Intent3D

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/WeitaiKang/Intent3D)

WeitaiKang / Intent3D

[ICLR 2025] Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention

☆29

Alternatives and similar repositories for Intent3D

Users that are interested in Intent3D are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

WeitaiKang / Robin3D
View on GitHub
[ICCV 2025] Improving 3D Large Language Model via Robust Instruction Tuning
☆71Oct 19, 2025Updated 9 months ago
facebookresearch / univlg
View on GitHub
Unifying 2D and 3D Vision-Language Understanding
☆126Jul 2, 2026Updated 3 weeks ago
qzp2018 / MCLN
View on GitHub
This is a PyTorch implementation of MCLN proposed by our paper "Multi-branch Collaborative Learning Network for 3D Visual Grounding"(ECCV…
☆27Oct 10, 2024Updated last year
WHB139426 / TAB-Agent
View on GitHub
Think, Act, Build: An Agentic Framework with Vision Language Models for Zero-Shot 3D Visual Grounding
☆26Apr 5, 2026Updated 3 months ago
gabriellelittle1 / FlairGPT
View on GitHub
All code for FlairGPT: Repurposing LLMs for Interior Designs, Eurographics 2025
☆21Mar 6, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
nickgkan / butd_detr
View on GitHub
Code for the ECCV22 paper "Bottom Up Top Down Detection Transformers for Language Grounding in Images and Point Clouds"
☆95Jun 9, 2023Updated 3 years ago
JiawLin / SeqVLM
View on GitHub
[ACMMM 2025] Official implementation of SeqVLM: Proposal-Guided Multi-View Sequences Reasoning via VLM for Zero Shot 3D Visual Grounding
☆24Nov 25, 2025Updated 8 months ago
KuanchihHuang / Reason3D
View on GitHub
[3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model
☆124May 30, 2025Updated last year
liudaizong / Awesome-3D-Visual-Grounding
View on GitHub
😎 up-to-date & curated list of awesome 3D Visual Grounding papers, methods & resources.
☆283Jan 14, 2026Updated 6 months ago
iris0329 / SeeGround
View on GitHub
[CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding
☆222Apr 21, 2025Updated last year
PQ3D / PQ3D
View on GitHub
Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"
☆86Aug 2, 2024Updated last year
yangzhifeio / MMGDreamer
View on GitHub
[AAAI 2025]MMGDreamer: Mixed-Modality Graph for Geometry-Controllable 3D Indoor Scene Generation
☆35Jul 26, 2025Updated last year
ZzZZCHS / Chat-Scene
View on GitHub
[NeurIPS 2024 & TPAMI 2026] Chat-Scene: Bridging 3D Scene and Large Language Models with Object Identifiers
☆216Apr 12, 2026Updated 3 months ago
3DSSG / 3DSSG.github.io
View on GitHub
☆18May 7, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ZzZZCHS / WS-3DVG
View on GitHub
[ICCV 2023] Distilling Coarse-to-fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding
☆14Oct 2, 2024Updated last year
ZhenyangLiu / ReasonGrounder
View on GitHub
☆15Jul 11, 2025Updated last year
InternRobotics / Grounded_3D-LLM
View on GitHub
Code&Data for Grounded 3D-LLM with Referent Tokens
☆136Jan 5, 2025Updated last year
heng-hw / SpaCap3D
View on GitHub
[IJCAI 2022] Spatiality-guided Transformer for 3D Dense Captioning on Point Clouds (official pytorch implementation)
☆21Aug 31, 2022Updated 3 years ago
ZCMax / LLaVA-3D
View on GitHub
[ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World
☆387Oct 21, 2025Updated 9 months ago
W-Ted / N3D-VLM
View on GitHub
Official code for paper: N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models
☆117Jan 14, 2026Updated 6 months ago
yanmin-wu / EDA
View on GitHub
[CVPR 2023] EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding
☆135Oct 11, 2023Updated 2 years ago
SparklingH / BloomScene
View on GitHub
BloomScene: Lightweight Structured 3D Gaussian Splatting for Crossmodal Scene Generation (AAAI 2025)
☆19Jan 13, 2025Updated last year
WeitaiKang / SegVG
View on GitHub
[ECCV 2024] SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding
☆63Oct 22, 2024Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
SJTU-DENG-Lab / R1-Zero-VSI
View on GitHub
☆42Jun 9, 2025Updated last year
AIGeeksGroup / 3D-R1
View on GitHub
3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding
☆414Updated this week
LZ-CH / DSPNet
View on GitHub
The official repository of [CVPR2025] DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering
☆28Apr 18, 2025Updated last year
Visual-AI / 3DRS
View on GitHub
[NeurIPS 2025] 3DRS: MLLMs Need 3D-Aware Representation Supervision for Scene Understanding
☆158Dec 9, 2025Updated 7 months ago
jinpeng0528 / SEFE
View on GitHub
☆13May 6, 2025Updated last year
yanghan-yh / MCA-Ctrl
View on GitHub
CVPR2025-Multi-party Collaborative Attention Control for Image Customization
☆17May 14, 2025Updated last year
getterupper / PreWorld
View on GitHub
[ICLR 2025] Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving
☆56Feb 14, 2025Updated last year
AdaptVision / AdaptVision
View on GitHub
[CVPR 2026] AdaptVision: Efficient Vision-Language Models via Adaptive Visual Acquisition
☆40Apr 27, 2026Updated 2 months ago
CognitiveAISystems / 3DGraphLLM
View on GitHub
[ICCV 2025] 3DGraphLLM is a model that uses a 3D scene graph and an LLM to perform 3D vision-language tasks.
☆123Mar 23, 2026Updated 4 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
li-xirong / video-retrieval
View on GitHub
Deep Learning for Video Retrieval by Natural Language
☆11Oct 20, 2019Updated 6 years ago
pauljaffe / lumos-ncpt-tools
View on GitHub
☆14Oct 25, 2022Updated 3 years ago
GWxuan / TSP3D
View on GitHub
[CVPR 2025, All Strong Accept] TSP3D: Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding
☆252Jun 11, 2025Updated last year
bin123apple / InfantAgent
View on GitHub
[NeurIPS 2025] A multimodal agent that can interact with its own PC in a multimodal manner.
☆39Apr 23, 2026Updated 3 months ago
ZCMax / ScanReason
View on GitHub
[ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities
☆85Oct 10, 2024Updated last year
sfu-gruvi-3dv / s_squeezer
View on GitHub
☆14Jun 22, 2022Updated 4 years ago
pzhren / InfiniteWorld
View on GitHub
☆86Jun 16, 2026Updated last month