Official implementation of "Can Language Understand Depth?"
☆83Oct 21, 2022Updated 3 years ago
Alternatives and similar repositories for DepthCLIP
Users that are interested in DepthCLIP are comparing it to the libraries listed below
Sorting:
- ☆17Oct 21, 2021Updated 4 years ago
- Official pytorch implementation of "DSPoint: Dual-scale Point Cloud Recognition with High-frequency Fusion"☆19Feb 4, 2025Updated last year
- [ECCV2022] RA-Depth: Resolution Adaptive Self-Supervised Monocular Depth Estimation☆74Oct 13, 2022Updated 3 years ago
- [ICCV 2023] PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learning☆287Aug 12, 2025Updated 7 months ago
- [AAAI 2023] Zero-Shot Enhancement of CLIP with Parameter-free Attention☆93Apr 29, 2023Updated 2 years ago
- [TCSVT2024] MonoDiffusion: Self-Supervised Monocular Depth Estimation Using Diffusion Model☆32Mar 27, 2025Updated 11 months ago
- This is a repo for CVPR 2022 Paper with Code☆10Apr 13, 2022Updated 3 years ago
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆13Sep 30, 2023Updated 2 years ago
- AAAI 2024-Controllable Mind Visual Diffusion Model☆16Dec 18, 2023Updated 2 years ago
- [arVix 2021] "SUB-Depth: Self-distillation and Uncertainty Boosting Self-supervised Monocular Depth Estimation"☆17Apr 19, 2022Updated 3 years ago
- 适合科研人员的Python绘图工具☆15Jun 13, 2024Updated last year
- ☆73Apr 5, 2022Updated 3 years ago
- Python3 / PyTorch implementation of the following paper: Fine-grained Semantics-aware Representation Enhancement for Self-supervisedMonoc…☆95Mar 17, 2023Updated 3 years ago
- [ECCV2022] 3D-PL: Domain Adaptive Depth Estimation with 3D-aware Pseudo-Labeling☆17Sep 20, 2022Updated 3 years ago
- ☆22Apr 26, 2021Updated 4 years ago
- 🚀🚀🚀 [Journal Pre-print] Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive Review☆15Oct 13, 2025Updated 5 months ago
- (ICCV2023) Official implementation of 'ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding with GPT and Prototype Guidance'…☆59Apr 18, 2024Updated last year
- Self-supervised temporally consistent depth estimation☆71Sep 9, 2023Updated 2 years ago
- Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer☆31Mar 5, 2026Updated 2 weeks ago
- [NeurIPS 2023] Official Implementation of "PaintSeg: Painting Pixels for Training-free Segmentation"☆14Dec 31, 2023Updated 2 years ago
- [CVPR2023] This is an official implementation for "PlaneDepth: Self-supervised Depth Estimation via Orthogonal Planes".☆107Jun 6, 2023Updated 2 years ago
- Deep Line Encoding for Monocular 3D Object Detection and Depth Prediction☆16Nov 22, 2021Updated 4 years ago
- A repository of AAAI 2024 paper 'SAMFlow: Eliminating Any Fragmentation in Optical Flow with Segment Anything Model'☆20Jun 27, 2024Updated last year
- [ICCV 2023] The first DETR model for monocular 3D object detection with depth-guided transformer☆438Jul 15, 2025Updated 8 months ago
- [ECCV 2022] Uncertainty Quantification in Depth Estimation via Constrained Ordinal Regression☆13Mar 27, 2023Updated 2 years ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- A PyTorch implementation of the paper "MMoT: Mixture-of-Modality-Tokens Transformer for Composed Multimodal Conditional Image Synthesis".☆12Jan 16, 2023Updated 3 years ago
- 🏠 PyTorch implementation of our ICCV2021 paper: StructDepth: Leveraging the structural regularities for self-supervised indoor depth est…☆143Aug 21, 2021Updated 4 years ago
- Code for generating a single image pretraining dataset☆13Aug 3, 2021Updated 4 years ago
- Code for "Structured Sparsity Inducing Adaptive Optimizers for Deep Learning" in PyTorch☆18Feb 11, 2021Updated 5 years ago
- ☆34Feb 20, 2025Updated last year
- Official implementation of “4D LangVGGT: 4D Language-Visual Geometry Grounded Transformer”☆82Dec 10, 2025Updated 3 months ago
- VF-NeRF code. We learn to densely reconstruct indoor from multi-view images by representing the surface with Vector Fields (VF). We devel…☆20Sep 30, 2024Updated last year
- Official implementation of Nemesis: Normalizing the Soft-prompt Vectors of Vision-Language Models (ICLR 2024 Spotlight)☆15Dec 27, 2024Updated last year
- Official PyTorch implementation of CorrespondentDream: Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences (CVPR 2024 Po…☆19Apr 29, 2024Updated last year
- [NeurIPS2023] IEBins: Iterative Elastic Bins for Monocular Depth Estimation☆91Dec 30, 2024Updated last year
- ☆209Feb 24, 2024Updated 2 years ago
- [ACCV 2024 Poster] official code for "VIP: Versatile Image Outpainting Empowered by Multimodal Large Language Model"☆10Sep 28, 2024Updated last year
- [CVPR 2023] iQuery: Instruments as Queries for Audio-Visual Sound Separation☆72Jul 25, 2023Updated 2 years ago