[ICCV 2025] Official repository of the paper "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation"
☆180Nov 10, 2025Updated 4 months ago
Alternatives and similar repositories for Talk2DINO
Users that are interested in Talk2DINO are comparing it to the libraries listed below
Sorting:
- [CVPR'24] Code for Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models☆18Jul 22, 2024Updated last year
- [ICCV2025] Harnessing CLIP, DINO and SAM for Open Vocabulary Segmentation☆115Nov 22, 2025Updated 3 months ago
- [CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"☆125Oct 23, 2025Updated 4 months ago
- [WACV 2026] Official implementation of the paper: “CountingDINO: A Training-free Pipeline for Exemplar-based Class-Agnostic Counting”☆55Mar 8, 2026Updated last week
- FreeDA: Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation (CVPR 2024)☆49Aug 28, 2024Updated last year
- Official Implementation of "CAT-Seg🐱: Cost Aggregation for Open-Vocabulary Semantic Segmentation"☆365Apr 11, 2024Updated last year
- [CVPR 2025 Highlight] SoMA: Singular Value Decomposed Minor Components Adaptation for Domain Generalizable Representation Learning☆62Jun 26, 2025Updated 8 months ago
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆22Nov 8, 2023Updated 2 years ago
- [CVPR 2023 & IJCV 2025] Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation☆65Jul 29, 2025Updated 7 months ago
- ☆23Oct 17, 2025Updated 5 months ago
- [CVPR24] Official Implementation of GEM (Grounding Everything Module)☆138Apr 10, 2025Updated 11 months ago
- [ICCV 2025] Official implementation of the paper: "Dynamic-DINO: Fine-Grained Mixture of Experts Tuning for Real-time Open-Vocabulary Obj…☆74Jul 29, 2025Updated 7 months ago
- [ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning☆311Dec 21, 2025Updated 2 months ago
- An implementation of several unsupervised object discovery models (Slot Attention, SLATE, GNM) in PyTorch with pre-trained models.☆14May 26, 2025Updated 9 months ago
- Official code repository for "Video-Mined Task Graphs for Keystep Recognition in Instructional Videos" arXiv, 2023☆14Apr 1, 2024Updated last year
- cliptrase☆47Sep 1, 2024Updated last year
- Slot-TTA shows that test-time adaptation using slot-centric models can improve image segmentation on out-of-distribution examples.☆26Jun 20, 2023Updated 2 years ago
- ☆27Mar 7, 2025Updated last year
- [ICCV 2025] Language Driven Occupancy Prediction☆38Dec 23, 2024Updated last year
- officical code for ECCV 2024 paper "Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection"☆14Jul 4, 2024Updated last year
- Extended Implementation of FastLGS☆16Dec 17, 2024Updated last year
- Learning to Count without Annotations☆23May 24, 2024Updated last year
- ☆16Jun 30, 2025Updated 8 months ago
- The code implementation for the paper "DreamLifting: A Plug-in Module Lifting MV Diffusion Models for 3D Asset Generation".☆28Sep 1, 2025Updated 6 months ago
- [TIP 2025] Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation☆66Dec 22, 2025Updated 2 months ago
- ☆66Sep 8, 2025Updated 6 months ago
- [CVPR 2025] This repository is the official implementation of "ProKeR: A Kernel Perspective on Few-Shot Adaptation of Large Vision-Langua…☆21Apr 1, 2025Updated 11 months ago
- [CVPR2025] Feat2GS: Probing Visual Foundation Models with Gaussian Splatting☆229Jul 25, 2025Updated 7 months ago
- A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..☆843Jan 20, 2026Updated 2 months ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- This is the implementation of RT-GS2☆26Oct 16, 2024Updated last year
- [CVPR2025] Exploring CLIP’s Dense Knowledge for Weakly Supervised Semantic Segmentation☆69Jun 21, 2025Updated 8 months ago
- [ICCV2025] ModPrompt: Visual Modality Prompt for Adapting Vision-Language Object Detectors☆24Jul 10, 2025Updated 8 months ago
- ☆39Jul 12, 2024Updated last year
- showing how to use CLIP-Vip to do video search☆16Nov 16, 2023Updated 2 years ago
- [ICLR 2025] Flow Distillation Sampling: Regularizing 3D Gaussians with Pre-trained Matching Priors☆60Mar 4, 2025Updated last year
- Undistorted Depth Support for ScanNet++☆17Dec 8, 2023Updated 2 years ago
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆98Mar 26, 2025Updated 11 months ago
- Independent PyTorch Implementation of Object Scene Representation Transformer☆49May 25, 2023Updated 2 years ago