Adonis-galaxy / DepthCLIPLinks
Official implementation of "Can Language Understand Depth?"
☆82Updated 2 years ago
Alternatives and similar repositories for DepthCLIP
Users that are interested in DepthCLIP are comparing it to the libraries listed below
Sorting:
- ☆106Updated 2 years ago
- Official implementation of "WorDepth: Variational Language Prior for Monocular Depth Estimation"☆41Updated 6 months ago
- [NeurIPS2023] Implementation of the paper: Explore In-Context Learning for 3D Point Cloud Understanding☆70Updated 9 months ago
- [Preprint 2022] “Can We Solve 3D Vision Tasks Starting from A 2D Vision Transformer?” by Yi Wang, Zhiwen Fan, Tianlong Chen, Hehe Fan, Zh…☆62Updated 2 years ago
- Repository of Trans4PASS (accepted to CVPR2022)☆92Updated 2 years ago
- The offical implemention of JM3D.☆30Updated last week
- ☆36Updated 2 years ago
- (AAAI2024) Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models☆54Updated last year
- [ICCV 2023] CLIP2Point: Transfer CLIP to Point Cloud Classification with Image-Depth Pre-training☆120Updated last year
- [ECCV'24] A novel weakly supervised framework for 3D object detection from 2D bounding boxes. It can easily extend to novel scenarios and…☆31Updated last year
- [CVPR 2022] Multi-View Transformer for 3D Visual Grounding☆77Updated 2 years ago
- [ECCV 2022] Masked Discrimination for Self-Supervised Learning on Point Clouds☆96Updated 3 years ago
- ☆51Updated last year
- ☆21Updated 4 months ago
- ☆43Updated 2 years ago
- Masked Surfel Prediction for Self-Supervised Point Cloud Learning☆27Updated last year
- [NeurIPS 2022] Official implementation of the paper "Rethinking Resolution in the Context of Efficient Video Recognition".☆31Updated 2 years ago
- [CVPR2022 Oral] 3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds☆56Updated 2 years ago
- ☆73Updated 6 months ago
- (ECCV 2022) DODA: Data-oriented Sim-to-Real Domain Adaptation for 3D Semantic Segmentation☆49Updated 2 years ago
- [CVPR2023] Official implementation of “PiMAE: Point cloud and Image Interactive Masked Autoencoders for 3D Object Detecion”☆135Updated 8 months ago
- [NeurIPS 2022 Spotlight] P2P: Tuning Pre-trained Image Models for Point Cloud Analysis with Point-to-Pixel Prompting☆129Updated 2 years ago
- ☆19Updated 11 months ago
- Official implementation for [3DV 2024] `Pix4Point: Image Pretrained Standard Transformers for 3D Point Cloud Understanding`☆47Updated last year
- [CVPR-2022 (oral)]-Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation☆155Updated 2 years ago
- [CVPR 2023] EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding☆125Updated last year
- [ICCV 2023]The PyTorch implementation of TL-Align: Token-Label Alignment for Vision Transformers.☆23Updated 2 years ago
- [ICCV 2023] New framework: Domain adaptation using a single prompt. Main contribution: Prompt-driven Instance Normalization (PIN)☆120Updated 5 months ago
- CVPR2022: Large-scale Video Panoptic Segmentation in the Wild: A Benchmark☆144Updated 2 years ago
- (ICCV2023) IST-Net: Prior-free Category-level Pose Estimation with Implicit Space Transformation☆114Updated last year