dk-liang/UniSeg3D

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dk-liang/UniSeg3D)

dk-liang / UniSeg3D

[NeurIPS 2024] A Unified Framework for 3D Scene Understanding

☆179

Alternatives and similar repositories for UniSeg3D

Users that are interested in UniSeg3D are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DYZhang09 / ToC3D
View on GitHub
[ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression
☆53Sep 21, 2024Updated last year
XenoZLH / Shuffle-R1
View on GitHub
Official code repository of Shuffle-R1
☆26Feb 23, 2026Updated 5 months ago
PQ3D / PQ3D
View on GitHub
Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"
☆85Aug 2, 2024Updated last year
DYZhang09 / ViTWSS3D
View on GitHub
[ICCV 23] A Simple Vision Transformer for Weakly Semi-supervised 3D Object Detection
☆13Apr 12, 2024Updated 2 years ago
LMD0311 / PointMamba
View on GitHub
[NeurIPS 2024] PointMamba: A Simple State Space Model for Point Cloud Analysis
☆547Mar 19, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Pointcept / OpenIns3D
View on GitHub
[ECCV'24] OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation
☆207Oct 19, 2024Updated last year
zc-zhao / DriveMonkey
View on GitHub
the official code of DriveMonkey
☆45Mar 20, 2026Updated 4 months ago
H-EmbodVis / PointTPA
View on GitHub
[CVPR 2026] PointTPA: Dynamic Network Parameter Adaptation for 3D Scene Understanding
☆33Apr 7, 2026Updated 3 months ago
Wang-pengfei / GGSD
View on GitHub
Official PyTorch codes for "Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation", ECCV2024
☆31Jul 19, 2024Updated 2 years ago
VinAIResearch / Open3DIS
View on GitHub
Open3DIS: Open-vocabulary 3D Instance Segmentation with 2D Mask Guidance (CVPR 2024)
☆135Nov 12, 2024Updated last year
CVRP-SOLE / SOLE
View on GitHub
[ICLR 2025] Official code of "Segment any 3D Object with Language"
☆73Apr 14, 2026Updated 3 months ago
heshuting555 / RefMask3D
View on GitHub
[ACM MM-2024] RefMask3D: Language-Guided Transformer for 3D Referring Segmentation
☆65Jul 29, 2024Updated last year
scene-verse / SceneVerse
View on GitHub
Official implementation of ECCV24 paper "SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding"
☆288Mar 19, 2025Updated last year
1ranGuan / VST
View on GitHub
[ECCV 26] Video Streaming Thinking
☆116Jun 18, 2026Updated last month
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
H-EmbodVis / NUMINA
View on GitHub
[CVPR 2026] When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models
☆68Apr 11, 2026Updated 3 months ago
InternRobotics / Grounded_3D-LLM
View on GitHub
Code&Data for Grounded 3D-LLM with Referent Tokens
☆136Jan 5, 2025Updated last year
yd-yin / SAI3D
View on GitHub
[CVPR 2024] SAI3D: Segment Any Instance in 3D Scenes
☆162Mar 29, 2024Updated 2 years ago
H-EmbodVis / HERMESV2
View on GitHub
HERMES++: Toward a Unified Driving World Model for 3D Scene Understanding and Generation
☆65May 1, 2026Updated 2 months ago
sg-3d / sg3d
View on GitHub
☆55Oct 3, 2024Updated last year
xuxw98 / ESAM
View on GitHub
[ICLR 2025, Oral] EmbodiedSAM: Online Segment Any 3D Thing in Real Time
☆634May 7, 2025Updated last year
lslrh / DMA
View on GitHub
Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024
☆32Jul 18, 2024Updated 2 years ago
Open3DA / LL3DA
View on GitHub
[CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Langu…
☆319Jul 17, 2024Updated 2 years ago
ZCMax / LLaVA-3D
View on GitHub
[ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World
☆387Oct 21, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
KuanchihHuang / Reason3D
View on GitHub
[3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model
☆124May 30, 2025Updated last year
filaPro / oneformer3d
View on GitHub
[CVPR2024] OneFormer3D: One Transformer for Unified Point Cloud Segmentation
☆605Oct 23, 2024Updated last year
CVMI-Lab / PLA
View on GitHub
(CVPR 2023) PLA: Language-Driven Open-Vocabulary 3D Scene Understanding & (CVPR2024) RegionPLC: Regional Point-Language Contrastive Learn…
☆301Jun 28, 2024Updated 2 years ago
LMD0311 / HERMES
View on GitHub
[ICCV 2025] HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation
☆259May 12, 2026Updated 2 months ago
ZiyuGuo99 / SAM2Point
View on GitHub
The Most Faithful Implementation of Segment Anything (SAM) in 3D
☆359Sep 11, 2024Updated last year
dk-liang / UniFuture
View on GitHub
[ICRA 2026] UniFuture: A 4D Driving World Model for Future Generation and Perception
☆163Feb 26, 2026Updated 4 months ago
HanchenTai / OV-SAM3D
View on GitHub
Open-Vocabulary SAM3D: Understand Any 3D Scene
☆44Jun 9, 2025Updated last year
PKU-EPIC / MaskClustering
View on GitHub
[CVPR 24] MaskClustering: View Consensus based Mask Graph Clustering for Open-Vocabulary 3D Instance Segmentation
☆129Apr 25, 2024Updated 2 years ago
sosppxo / MDIN
View on GitHub
[MM2024 Oral] 3D-GRES: Generalized 3D Referring Expression Segmentation
☆43Dec 15, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ZCMax / ScanReason
View on GitHub
[ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities
☆85Oct 10, 2024Updated last year
H-EmbodVis / MERGE
View on GitHub
[NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models
☆219Oct 31, 2025Updated 8 months ago
LMD0311 / DAPT
View on GitHub
[CVPR 2024] Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis
☆171Oct 11, 2024Updated last year
ayushjain1144 / odin
View on GitHub
Code for the paper: "ODIN: A Single Model for 2D and 3D Segmentation" (CVPR 2024)
☆177Feb 27, 2026Updated 4 months ago
Adlith / MoE-Jetpack
View on GitHub
[NeurIPS 24] MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks
☆137Nov 23, 2024Updated last year
wangzy22 / XMask3D
View on GitHub
[NeurIPS 2024] XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation
☆37Jan 20, 2025Updated last year
H-EmbodVis / HyDRA
View on GitHub
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models
☆267Updated this week