vgthengane/Awesome-FMs-in-3D

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/vgthengane/Awesome-FMs-in-3D)

vgthengane / Awesome-FMs-in-3D

A comprehensive surevy on Multimodal Models in 3D

☆77

Alternatives and similar repositories for Awesome-FMs-in-3D

Users that are interested in Awesome-FMs-in-3D are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TangYuan96 / MiniGPT-3D
View on GitHub
[MM 2024] [Need only a 3090] MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D Priors
☆131Mar 20, 2026Updated 4 months ago
ZCMax / LLaVA-3D
View on GitHub
[ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World
☆385Oct 21, 2025Updated 9 months ago
fanglaosi / Point-In-Context
View on GitHub
[NeurIPS2023] Implementation of the paper: Explore In-Context Learning for 3D Point Cloud Understanding
☆74Mar 18, 2026Updated 4 months ago
LHDuan / ConDaFormer
View on GitHub
[NeurIPS'23] ConDaFormer: Disassembled Transformer with Local Structure Enhancement for 3D Point Cloud Understanding
☆12Dec 9, 2023Updated 2 years ago
wangzy22 / XMask3D
View on GitHub
[NeurIPS 2024] XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation
☆37Jan 20, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
TyroneLi / CUA_O3D
View on GitHub
CVPR2025
☆23Aug 16, 2025Updated 11 months ago
LMD0311 / PointMamba
View on GitHub
[NeurIPS 2024] PointMamba: A Simple State Space Model for Point Cloud Analysis
☆547Mar 19, 2025Updated last year
lmomoy / MAENet
View on GitHub
Official implementation of the paper "MAENet: Boost Image-guided Point Cloud Completion More Accurate and Even" (Information Fusion 2025)
☆16Jun 4, 2025Updated last year
MukundVarmaT / Lift3D
View on GitHub
☆34Apr 4, 2024Updated 2 years ago
Haochen-Wang409 / ross3d
View on GitHub
[ICCV'25] Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness
☆70Jul 22, 2025Updated last year
marco-garosi / COPS
View on GitHub
Official implementation of the WACV 2025 paper "3D Part Segmentation via Geometric Aggregation of 2D Visual Features"
☆25Jun 8, 2025Updated last year
ZCMax / ScanReason
View on GitHub
[ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities
☆85Oct 10, 2024Updated last year
ZhaochongAn / Multimodality-3D-Few-Shot
View on GitHub
[ICLR 2025 Spotlight] Multimodality Helps Few-shot 3D Point Cloud Semantic Segmentation
☆73May 7, 2025Updated last year
fudan-zvg / UniUGG
View on GitHub
UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding. Accepted to ICLR 2026.
☆63Updated this week
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
IRMVLab / Point-Mamba
View on GitHub
Point Mamba
☆134May 7, 2024Updated 2 years ago
Ivan-Tang-3D / ViewRefer3D
View on GitHub
(ICCV2023) Official implementation of 'ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding with GPT and Prototype Guidance'…
☆60Apr 18, 2024Updated 2 years ago
auniquesun / PPT
View on GitHub
[ICRA 2024] Official Implementation of the paper "Parameter-efficient Prompt Learning for 3D Point Cloud Understanding"
☆30Mar 13, 2026Updated 4 months ago
zhengxiaozx / PointDif
View on GitHub
☆45Nov 1, 2024Updated last year
Mr-Neko / JM3D
View on GitHub
The offical implemention of JM3D.
☆31Apr 8, 2026Updated 3 months ago
facebookresearch / univlg
View on GitHub
Unifying 2D and 3D Vision-Language Understanding
☆126Jul 2, 2026Updated 2 weeks ago
Hoyyyaard / LSceneLLM
View on GitHub
☆74Mar 29, 2025Updated last year
ltwu6 / cross-pcc
View on GitHub
☆27May 3, 2024Updated 2 years ago
Open3DA / LL3DA
View on GitHub
[CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Langu…
☆319Jul 17, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
LiuHengyu321 / IR3D-Bench
View on GitHub
[NeurIPS DB 2025] IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering
☆46Oct 15, 2025Updated 9 months ago
a4152684 / KT-Net
View on GitHub
KT-Net: Knowledge Transfer for Unpaired 3D Shape Completion(point cloud completion)
☆25Oct 20, 2024Updated last year
yangyangyang127 / PointCLIP_V2
View on GitHub
[ICCV 2023] PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learning
☆291Aug 12, 2025Updated 11 months ago
TangYuan96 / GreenPLM
View on GitHub
[AAAI 2025] More Text, Less Point: Towards 3D Data-Efficient Point-Language Understanding
☆28Mar 20, 2026Updated 4 months ago
liziwennba / SURPRISE3D
View on GitHub
☆22Apr 14, 2026Updated 3 months ago
Ivan-Tang-3D / Any2Point
View on GitHub
[ECCV2024] Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding
☆127Jul 2, 2024Updated 2 years ago
heshuting555 / SegPoint
View on GitHub
☆38Jul 19, 2024Updated 2 years ago
GAP-LAB-CUHK-SZ / SAMPro3D
View on GitHub
SAMPro3D: Locating SAM Prompts in 3D for Zero-Shot Instance Segmentation (3DV 2025)
☆171Apr 17, 2025Updated last year
InternRobotics / Grounded_3D-LLM
View on GitHub
Code&Data for Grounded 3D-LLM with Referent Tokens
☆136Jan 5, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
sled-group / 3D-GRAND
View on GitHub
[CVPR 2025] 3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs
☆54Jun 13, 2024Updated 2 years ago
IHe-KaiI / CTRL-D
View on GitHub
CTRL-D: Controllable Dynamic 3D Scene Editing with Personalized 2D Diffusion.
☆59Apr 28, 2025Updated last year
zkytony / graphspn
View on GitHub
Graph-Structured Sum-Product Networks (GraphSPN), AAAI'18
☆10May 8, 2022Updated 4 years ago
InternRobotics / PointLLM
View on GitHub
[ECCV 2024 Best Paper Candidate & TPAMI 2025] PointLLM: Empowering Large Language Models to Understand Point Clouds
☆1,038May 15, 2026Updated 2 months ago
KuanchihHuang / Reason3D
View on GitHub
[3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model
☆124May 30, 2025Updated last year
JulesSanchez / 3DLabelProp
View on GitHub
☆25Nov 6, 2024Updated last year
mingukkang / FlashDecoder
View on GitHub
Official FlashDecoder Github
☆17Apr 4, 2026Updated 3 months ago