AIGeeksGroup/3D-R1

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AIGeeksGroup/3D-R1)

AIGeeksGroup / 3D-R1

3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding

☆414

Alternatives and similar repositories for 3D-R1

Users that are interested in 3D-R1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

VITA-Group / VLM-3R
View on GitHub
[CVPR 2026] VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction
☆428Updated this week
THU-SI / LangScene-X
View on GitHub
[ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion
☆302Jul 15, 2025Updated last year
THU-SI / Spatial-MLLM
View on GitHub
[NeurIPS 2025 Spotlight] Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence
☆479Feb 5, 2026Updated 5 months ago
LaVi-Lab / VG-LLM
View on GitHub
The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'
☆245Nov 28, 2025Updated 7 months ago
Visual-AI / 3DRS
View on GitHub
[NeurIPS 2025] 3DRS: MLLMs Need 3D-Aware Representation Supervision for Scene Understanding
☆158Dec 9, 2025Updated 7 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
fudan-zvg / UniUGG
View on GitHub
UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding. Accepted to ICLR 2026.
☆63Updated this week
ZCMax / LLaVA-3D
View on GitHub
[ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World
☆384Oct 21, 2025Updated 8 months ago
AIGeeksGroup / Nav-R1
View on GitHub
Nav-R1: Reasoning and Navigation in Embodied Scenes
☆128Oct 31, 2025Updated 8 months ago
W-Ted / N3D-VLM
View on GitHub
Official code for paper: N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models
☆116Jan 14, 2026Updated 6 months ago
Yangr116 / VST
View on GitHub
[ECCV2026] Visual Spatial Tuning
☆198Mar 25, 2026Updated 3 months ago
JinLi998 / CanonObjaverseDataset
View on GitHub
One-shot 3D Object Canonicalization based on Geometric and Semantic Consistency（CVPR highlight 2025）
☆76Dec 15, 2025Updated 7 months ago
yyfz / Pi3
View on GitHub
[ICLR 2026] π^3: Permutation-Equivariant Visual Geometry Learning
☆2,072Jul 3, 2026Updated 2 weeks ago
facebookresearch / locate-3d
View on GitHub
Open source repo for Locate 3D Model, 3D-JEPA and Locate 3D Dataset
☆453Jun 3, 2025Updated last year
nianticlabs / placeit3d
View on GitHub
[ICCV 2025] PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes
☆63Oct 3, 2025Updated 9 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
YkiWu / Point3R
View on GitHub
[NeurIPS 2025] Streaming 3D Reconstruction with Explicit Spatial Pointer Memory
☆191Mar 10, 2026Updated 4 months ago
wzzheng / StreamVGGT
View on GitHub
[ICLR 2026] Streaming 4D Visual Geometry Transformer
☆941Oct 27, 2025Updated 8 months ago
djiajunustc / 3D-LLaVA
View on GitHub
[CVPR 2025] 3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer
☆100May 26, 2025Updated last year
nv-tlabs / PartField
View on GitHub
[ICCV 2025] PartField: Learning 3D Feature Fields for Part Segmentation and Beyond
☆441Jun 2, 2026Updated last month
jhkim0759 / FastMesh
View on GitHub
[3DV 2026] FastMesh: Efficient Artistic Mesh Generation via Component Decoupling
☆136Nov 11, 2025Updated 8 months ago
DengKaiCQ / VGGT-Long
View on GitHub
Official implement of VGGT-Long
☆882Mar 20, 2026Updated 4 months ago
NIRVANALAN / STream3R
View on GitHub
Dynamic 3D Foundation Model using Causal Transformer. [ICLR 2026]
☆392May 8, 2026Updated 2 months ago
liruilong940607 / prope
View on GitHub
Cameras as Relative Positional Encoding
☆739Dec 18, 2025Updated 7 months ago
NVlabs / EdgeRunner
View on GitHub
[ICLR 2025] EdgeRunner: Auto-regressive Auto-encoder for Efficient Mesh Generation
☆311Dec 22, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
OuyangKun10 / SpaceR
View on GitHub
SpaceR: The first MLLM empowered by SG-RLVR for video spatial reasoning
☆111Jul 9, 2025Updated last year
ziangcao0312 / PhysX-3D
View on GitHub
PhysX: Physical-Grounded 3D Asset Generation (NeurIPS 2025, Spotlight)
☆380Dec 18, 2025Updated 7 months ago
G-1nOnly / Dens3R
View on GitHub
[ICLR2026] Official Implementation of "Dens3R: A Foundation Model for 3D Geometry Prediction"
☆395May 14, 2026Updated 2 months ago
AiEson / Part-X-MLLM
View on GitHub
[ICLR 26] Part-X-MLLM: Part-aware 3D Multimodal Large Language Model
☆118Jun 17, 2026Updated last month
facebookresearch / 4DGT
View on GitHub
[NeurIPS 2025 (Spotlight)] The implementation for the paper "4DGT Learning a 4D Gaussian Transformer Using Real-World Monocular Videos"
☆466Sep 19, 2025Updated 10 months ago
manycore-research / SpatialGen
View on GitHub
[3DV 2026] SpatialGen: Layout-guided 3D Indoor Scene Generation
☆402Apr 18, 2026Updated 3 months ago
ethz-vlg / mvtracker
View on GitHub
[ICCV 2025 Oral] MVTracker: Multi-view 3D Point Tracking
☆511Nov 3, 2025Updated 8 months ago
WU-CVGL / GS-Reasoner
View on GitHub
Reasoning in Space via Grounding in the World (ICLR 2025)
☆56Nov 3, 2025Updated 8 months ago
NJU-3DV / SpatialVID
View on GitHub
[CVPR 2026] SpatialVID: A Large-Scale Video Dataset with Spatial Annotations
☆585Apr 22, 2026Updated 2 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
WU-CVGL / SIU3R
View on GitHub
[NeurIPS 2025 Spotlight] Official implementation of the SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alig…
☆163Sep 25, 2025Updated 9 months ago
SunYangtian / UniGeo
View on GitHub
UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation
☆136Jun 10, 2025Updated last year
facebookresearch / sonata
View on GitHub
[CVPR'25 Highlight] Official repository of Sonata: Self-Supervised Learning of Reliable Point Representations
☆764Jun 4, 2025Updated last year
baaivision / Uni3D
View on GitHub
[ICLR'24 Spotlight] Uni3D: 3D Visual Representation from BAAI
☆676Jan 12, 2026Updated 6 months ago
unique1i / SceneSplat
View on GitHub
[ICCV 2025 Oral] SceneSplat - Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining
☆354May 25, 2026Updated last month
Davidyao99 / uni4d
View on GitHub
[CVPR 2025] Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video
☆225May 25, 2025Updated last year
facebookresearch / map-anything
View on GitHub
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
☆3,569Updated this week