Official implementation of "Can Language Understand Depth?"
☆83Oct 21, 2022Updated 3 years ago
Alternatives and similar repositories for DepthCLIP
Users that are interested in DepthCLIP are comparing it to the libraries listed below
Sorting:
- Official implementation of "WorDepth: Variational Language Prior for Monocular Depth Estimation"☆46Feb 4, 2025Updated last year
- ☆23Aug 28, 2023Updated 2 years ago
- ☆17Oct 21, 2021Updated 4 years ago
- [ECCV2022] RA-Depth: Resolution Adaptive Self-Supervised Monocular Depth Estimation☆74Oct 13, 2022Updated 3 years ago
- ☆22Apr 26, 2021Updated 4 years ago
- [AAAI 2023] Zero-Shot Enhancement of CLIP with Parameter-free Attention☆93Apr 29, 2023Updated 2 years ago
- A PyTorch implementation of the paper "MMoT: Mixture-of-Modality-Tokens Transformer for Composed Multimodal Conditional Image Synthesis".☆12Jan 16, 2023Updated 3 years ago
- ☆22Nov 18, 2025Updated 3 months ago
- This is a repo for CVPR 2022 Paper with Code☆10Apr 13, 2022Updated 3 years ago
- [ICCV 2023] PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learning☆284Aug 12, 2025Updated 6 months ago
- Reimplementation of ECCV paper "NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis" with PyTorch Library.☆37Apr 7, 2022Updated 3 years ago
- [AAAI 2025] Official Implementation of I-HallA v1.0☆13Feb 2, 2025Updated last year
- [ISBI 2024] Semi-supervised Medical Image Segmentation Method Based on Cross-pseudo Labeling Leveraging Strong and Weak Data Augmentation…☆16Feb 23, 2025Updated last year
- Code for generating a single image pretraining dataset☆13Aug 3, 2021Updated 4 years ago
- 适合科研人员的Python绘图工具☆14Jun 13, 2024Updated last year
- [NeurIPS 2023] Official Implementation of "PaintSeg: Painting Pixels for Training-free Segmentation"☆14Dec 31, 2023Updated 2 years ago
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆14Sep 30, 2023Updated 2 years ago
- Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer☆28Nov 4, 2025Updated 3 months ago
- Official implementation of the NRNS paper☆36Jun 13, 2022Updated 3 years ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- [TCSVT2024] MonoDiffusion: Self-Supervised Monocular Depth Estimation Using Diffusion Model☆32Mar 27, 2025Updated 11 months ago
- Unsupervised single image depth prediction with CNNs☆14Jun 25, 2021Updated 4 years ago
- [CVPR 2022] DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting☆544Sep 15, 2023Updated 2 years ago
- CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models☆91Jul 4, 2024Updated last year
- Gradient-based Uncertainty for Monocular Depth Estimation (ECCV 2022)☆54Sep 7, 2023Updated 2 years ago
- Official code release of "CLIP goes 3D: Leveraging Prompt Tuning for Language Grounded 3D Recognition"☆242May 1, 2023Updated 2 years ago
- [MICCAI 2024] PEPSI: Pathology-Enhanced Pulse-Sequence-Invariant Representations for Brain MRI☆18Jan 31, 2026Updated last month
- AAAI 2024-Controllable Mind Visual Diffusion Model☆16Dec 18, 2023Updated 2 years ago
- [CVPR 2025] Fine-Grained Image-Text Correspondence with Cost Aggregation for Open-Vocabulary Part Segmentation☆22Nov 17, 2025Updated 3 months ago
- [CVPR 23] Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!☆17May 14, 2024Updated last year
- 🚀🚀🚀 [Journal Pre-print] Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive Review☆15Oct 13, 2025Updated 4 months ago
- Official implementation for ECCV 2022 paper "Disentangling Object Motion and Occlusion for Unsupervised Multi-frame Monocular Depth"☆130Feb 28, 2023Updated 3 years ago
- [WACV 2026] MomentMix Augmentation with Length-Aware DETR for Temporally Robust Moment Retrieval☆13Sep 18, 2025Updated 5 months ago
- ☆34Feb 20, 2025Updated last year
- Enhancing Multimodal Compositional Reasoning of Visual Language Models with Generative Negative Mining, WACV 2024☆14Jan 3, 2024Updated 2 years ago
- (ICCV2023) Official implementation of 'ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding with GPT and Prototype Guidance'…☆59Apr 18, 2024Updated last year
- [ECCV2022] 3D-PL: Domain Adaptive Depth Estimation with 3D-aware Pseudo-Labeling☆17Sep 20, 2022Updated 3 years ago
- [ECCV 2022] Uncertainty Quantification in Depth Estimation via Constrained Ordinal Regression☆13Mar 27, 2023Updated 2 years ago
- Official Implementation of ACMMM'21 paper "Wisdom of (Binned) Crowds: A Bayesian Stratification Paradigm for Crowd Counting"☆16May 17, 2022Updated 3 years ago