Adonis-galaxy / DepthCLIPLinks
Official implementation of "Can Language Understand Depth?"
☆83Updated 3 years ago
Alternatives and similar repositories for DepthCLIP
Users that are interested in DepthCLIP are comparing it to the libraries listed below
Sorting:
- ☆110Updated 2 years ago
- Official implementation of "WorDepth: Variational Language Prior for Monocular Depth Estimation"☆44Updated 11 months ago
- [Preprint 2022] “Can We Solve 3D Vision Tasks Starting from A 2D Vision Transformer?” by Yi Wang, Zhiwen Fan, Tianlong Chen, Hehe Fan, Zh…☆63Updated 2 years ago
- [ECCV'24] A novel weakly supervised framework for 3D object detection from 2D bounding boxes. It can easily extend to novel scenarios and…☆34Updated last year
- [NeurIPS2023] Implementation of the paper: Explore In-Context Learning for 3D Point Cloud Understanding☆72Updated last year
- The offical implemention of JM3D.☆31Updated 4 months ago
- Repository of Trans4PASS (accepted to CVPR2022)☆95Updated 2 years ago
- (AAAI2024) Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models☆57Updated last year
- Masked Surfel Prediction for Self-Supervised Point Cloud Learning☆27Updated 2 years ago
- ☆37Updated 2 years ago
- [ECCV 2022] Masked Discrimination for Self-Supervised Learning on Point Clouds☆95Updated 3 years ago
- [ICCV 2023] CLIP2Point: Transfer CLIP to Point Cloud Classification with Image-Depth Pre-training☆127Updated last year
- [NeurIPS 2022] Official implementation of the paper "Rethinking Resolution in the Context of Efficient Video Recognition".☆31Updated 3 years ago
- ☆21Updated 8 months ago
- [NeurIPS 2022 Spotlight] P2P: Tuning Pre-trained Image Models for Point Cloud Analysis with Point-to-Pixel Prompting☆131Updated 2 years ago
- ☆74Updated 11 months ago
- ☆58Updated last year
- [NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation☆122Updated last year
- [CVPR 2022] Multi-View Transformer for 3D Visual Grounding☆78Updated 3 years ago
- [ICCV 2023]The PyTorch implementation of TL-Align: Token-Label Alignment for Vision Transformers.☆23Updated 2 years ago
- ☆31Updated 2 years ago
- (ICCV2023) IST-Net: Prior-free Category-level Pose Estimation with Implicit Space Transformation☆118Updated 2 years ago
- [CVPR'24] Neural Clustering based Visual Representation Learning☆45Updated 3 months ago
- SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality☆35Updated last year
- [CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"☆36Updated last year
- ☆49Updated 2 years ago
- [AAAI 2024] The official implementation of the paper "3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Refer…☆44Updated 2 years ago
- Toolkit for VIPER benchmark☆15Updated 5 years ago
- ☆19Updated last year
- (ECCV 2022) DODA: Data-oriented Sim-to-Real Domain Adaptation for 3D Semantic Segmentation☆49Updated 3 years ago