sled-group/3D-GRAND

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sled-group/3D-GRAND)

sled-group / 3D-GRAND

[CVPR 2025] 3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs

☆54

Alternatives and similar repositories for 3D-GRAND

Users that are interested in 3D-GRAND are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OpenM3D / M3DBench
View on GitHub
[ECCV 2024] M3DBench introduces a comprehensive 3D instruction-following dataset with support for interleaved multi-modal prompts.
☆61Oct 1, 2024Updated last year
3dlg-hcvc / multi3drefer
View on GitHub
[ICCV 2023] Multi3DRefer: Grounding Text Description to Multiple 3D Objects
☆98Mar 26, 2026Updated 4 months ago
VincentDENGP / 3D-LR
View on GitHub
Can 3D Vision-Language Models Truly Understand Natural Language?
☆20Mar 28, 2024Updated 2 years ago
3d-vista / 3D-VisTA
View on GitHub
Official implementation of ICCV 2023 paper "3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment"
☆215Sep 7, 2023Updated 2 years ago
3dlg-hcvc / r3ds
View on GitHub
Official repository of the paper "R3DS: Reality-linked 3D Scenes for Panoramic Scene Understanding"
☆23Dec 2, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
cvlab-columbia / dreamitate
View on GitHub
Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)
☆59Jun 7, 2025Updated last year
Open3DA / LL3DA
View on GitHub
[CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Langu…
☆319Jul 17, 2024Updated 2 years ago
YunzeMan / Lexicon3D
View on GitHub
[NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding
☆102Feb 2, 2025Updated last year
scene-verse / SceneVerse
View on GitHub
Official implementation of ECCV24 paper "SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding"
☆288Mar 19, 2025Updated last year
TRI-ML / OctMAE
View on GitHub
Zero-Shot Multi-Object Shape Completion (ECCV 2024)
☆31Apr 1, 2025Updated last year
3dlg-hcvc / smc
View on GitHub
[3DV 2025] Official implementation of the paper "SceneMotifCoder: Example-driven Visual Program Learning for Generating 3D Object Arrange…
☆47Oct 14, 2025Updated 9 months ago
InternRobotics / Grounded_3D-LLM
View on GitHub
Code&Data for Grounded 3D-LLM with Referent Tokens
☆136Jan 5, 2025Updated last year
CASAGPT / CASA-GPT
View on GitHub
PyTorch implementation of the paper: CASAGPT: Cuboid Arrangement and Scene Assembly for Interior Design [CVPR 2025]
☆15Apr 5, 2025Updated last year
ATR-DBI / ScanQA
View on GitHub
☆162Aug 23, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Universal-Control / ppt_learning
View on GitHub
A unified robotic manipulation learning framework
☆24Sep 4, 2025Updated 10 months ago
sled-group / RACER
View on GitHub
[ICRA 2025] RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learning
☆47Oct 10, 2024Updated last year
sled-group / chat-with-nerf
View on GitHub
[ICRA 2024] Chat with NeRF enables users to interact with a NeRF model by typing in natural language.
☆322Oct 10, 2025Updated 9 months ago
embodied-generalist / embodied-generalist
View on GitHub
[ICML 2024] LEO: An Embodied Generalist Agent in 3D World
☆487Apr 20, 2025Updated last year
allenai / objathor
View on GitHub
Python package for importing and loading external assets into AI2THOR
☆34Jan 5, 2026Updated 6 months ago
ZzZZCHS / Chat-Scene
View on GitHub
[NeurIPS 2024 & TPAMI 2026] Chat-Scene: Bridging 3D Scene and Large Language Models with Object Identifiers
☆216Apr 12, 2026Updated 3 months ago
referit3d / referit3d
View on GitHub
Code accompanying our ECCV-2020 paper on 3D Neural Listeners.
☆141Jun 29, 2021Updated 5 years ago
wenhaochai / claude-plugins
View on GitHub
Personal Claude Code plugin marketplace
☆16Jul 21, 2026Updated last week
vlc-robot / robot_sugar
View on GitHub
Official implementation of "SUGAR: Pre-training 3D Visual Representations for Robotics" (CVPR'24).
☆46Jun 19, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
facebookresearch / agent2sim
View on GitHub
Agent-to-Sim Learning Interactive Behavior from Casual Videos.
☆49Oct 16, 2024Updated last year
lck666666 / Mem3R
View on GitHub
Official implementation of Mem3R: Streaming 3D Reconstruction with Hybrid Memory via Test-Time Training.
☆44Apr 9, 2026Updated 3 months ago
gabriellelittle1 / FlairGPT
View on GitHub
All code for FlairGPT: Repurposing LLMs for Interior Designs, Eurographics 2025
☆21Mar 6, 2025Updated last year
fudan-zvg / UniUGG
View on GitHub
UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding. Accepted to ICLR 2026.
☆63Jul 16, 2026Updated last week
sled-group / moh
View on GitHub
[NeurIPS 2024] Official Repository of Multi-Object Hallucination in Vision-Language Models
☆37Nov 13, 2024Updated last year
JasonQSY / 3DOI
View on GitHub
[ICCV 2023] Understanding 3D Object Interaction from a Single Image
☆47Feb 29, 2024Updated 2 years ago
NVlabs / L4P
View on GitHub
(3DV 2026 Oral) L4P -- a feed-forward foundational model designed for multiple low-level 4D vision perception tasks.
☆72Dec 9, 2025Updated 7 months ago
idejie / 3DSyn
View on GitHub
☆12May 19, 2025Updated last year
UMass-Embodied-AGI / 3D-LLM
View on GitHub
Code for 3D-LLM: Injecting the 3D World into Large Language Models
☆1,211Jun 6, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
dw-dengwei / TreeSearchGen
View on GitHub
[CVPR 2025🔥] Official codebase for "Global-Local Tree Search in VLMs for 3D Indoor Scene Generation" and our arxiv 2026 extension
☆23Jun 5, 2026Updated last month
PQ3D / PQ3D
View on GitHub
Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"
☆86Aug 2, 2024Updated last year
Space3D-Bench / Space3D-Bench
View on GitHub
☆12Apr 18, 2025Updated last year
ZCMax / ScanReason
View on GitHub
[ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities
☆85Oct 10, 2024Updated last year
yinyunie / BlenderProc-3DFront
View on GitHub
Support BlenderProc2 with multi-GPU batch rendering and 3D visualization for 3D-Front
☆141Jul 21, 2023Updated 3 years ago
Kiteretsu77 / This_and_That_VDM
View on GitHub
This is the official implementation of Video Generation part of This&That: Language-Gesture Controlled Video Generation for Robot Plannin…
☆49Dec 19, 2025Updated 7 months ago
meta-scenes / MetaScenes
View on GitHub
☆68Dec 3, 2025Updated 7 months ago