HongkLin / TIDE
[CVPR 2025] A Unified Image-Dense Annotation Generation Model for Underwater Scenes
β21Updated last week
Alternatives and similar repositories for TIDE:
Users that are interested in TIDE are comparing it to the libraries listed below
- Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024β29Updated 9 months ago
- [CVPR 2025 Highlightπ₯] Official code repository for "Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuniβ¦β69Updated last week
- [3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Modelβ73Updated last week
- [ICLR 2025] Official code of "Segment any 3D Object with Language"β43Updated 2 months ago
- [MM2024 Oral] 3D-GRES: Generalized 3D Referring Expression Segmentationβ36Updated 4 months ago
- [NeurIPS 2024] XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentationβ32Updated 3 months ago
- [NeurIPS 2024] A Unified Framework for 3D Scene Understandingβ136Updated 4 months ago
- β26Updated 3 weeks ago
- [ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compressionβ41Updated 7 months ago
- Official PyTorch codes for "Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation", ECCV2024β28Updated 9 months ago
- Open-Vocabulary SAM3D: Understand Any 3D Sceneβ27Updated 7 months ago
- β36Updated 9 months ago
- ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention (ECCV 2024)β77Updated 8 months ago
- β20Updated 2 weeks ago
- [ICCV 23] A Simple Vision Transformer for Weakly Semi-supervised 3D Object Detectionβ12Updated last year
- Official repository for paper "Open Panoramic Segmentation" (OPS), ECCV 2024β27Updated 3 weeks ago
- [ICLR 2025] Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Drivingβ26Updated 2 months ago
- Paper: UniGS: Unified Language-Image-3D Pretraining with Gaussian Splattingβ15Updated last month
- [ECCV 2024] Monocular Occupancy Prediction for Scalable Indoor Scenesβ56Updated 6 months ago
- [AAAI 2024] Mono3DVG: 3D Visual Grounding in Monocular Images, AAAI, 2024β56Updated last year
- [CVPR 2025] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understandingβ129Updated 3 weeks ago
- β45Updated 3 months ago
- β38Updated 9 months ago
- β46Updated 4 months ago
- (AAAI2024) Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Modelsβ51Updated 11 months ago
- Official implementation of "Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness".β18Updated 2 weeks ago
- PerLDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Modelsβ43Updated 3 months ago
- [ECCV 2024] Official implementation of "RangeLDM: Fast Realistic LiDAR Point Cloud Generation"β32Updated 4 months ago
- officical code for ECCV 2024 paper "Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection"β14Updated 9 months ago
- [CVPR 2023] EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Groundingβ120Updated last year