[ICCV 2025] Official repository of the paper "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation"
☆184Nov 10, 2025Updated 5 months ago
Alternatives and similar repositories for Talk2DINO
Users that are interested in Talk2DINO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation☆114Mar 26, 2025Updated last year
- [CVPR 2025] Official Pytorch Code for Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation☆49Mar 27, 2025Updated last year
- [CVPR 2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detec…☆67Apr 4, 2025Updated last year
- [CBMI 2024 Best Paper] Official repository of the paper "Is CLIP the main roadblock for fine-grained open-world perception?".☆32May 12, 2025Updated 11 months ago
- [ICCV2025] Harnessing CLIP, DINO and SAM for Open Vocabulary Segmentation☆118Nov 22, 2025Updated 5 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"☆130Oct 23, 2025Updated 6 months ago
- ☆17Feb 20, 2025Updated last year
- Public code for XFactor: Introduces the first geometry-free model to achieve true self-supervised / pose-free Novel View Synthesis (NVS) …☆150Mar 25, 2026Updated last month
- Official Implementation of "CAT-Seg🐱: Cost Aggregation for Open-Vocabulary Semantic Segmentation"☆375Apr 11, 2024Updated 2 years ago
- [ICLR 2026] This is the official implementation of PG-Occ: Progressive Gaussian Transformer with Anisotropy-aware Sampling for Open Vocab…☆32Feb 19, 2026Updated 2 months ago
- [CVPR 2025 Highlight] SoMA: Singular Value Decomposed Minor Components Adaptation for Domain Generalizable Representation Learning☆66Jun 26, 2025Updated 10 months ago
- ☆23Oct 17, 2025Updated 6 months ago
- A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..☆861Apr 5, 2026Updated 3 weeks ago
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆23Nov 8, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ICCV 2025] Official implementation of the paper: "Dynamic-DINO: Fine-Grained Mixture of Experts Tuning for Real-time Open-Vocabulary Obj…☆77Jul 29, 2025Updated 9 months ago
- A list of papers about point cloud based place recognition, also known as loop closure detection in SLAM (processing)☆10Jan 30, 2024Updated 2 years ago
- [ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning☆317Dec 21, 2025Updated 4 months ago
- An implementation of several unsupervised object discovery models (Slot Attention, SLATE, GNM) in PyTorch with pre-trained models.☆15May 26, 2025Updated 11 months ago
- Code for the paper "Automating MedSAM by Learning Prompts with Weak Few-Shot Supervision", published at MICCAI-MedAGI workshop (2024)☆28Nov 25, 2024Updated last year
- Official code repository for "Video-Mined Task Graphs for Keystep Recognition in Instructional Videos" arXiv, 2023☆14Apr 1, 2024Updated 2 years ago
- cliptrase☆48Sep 1, 2024Updated last year
- Slot-TTA shows that test-time adaptation using slot-centric models can improve image segmentation on out-of-distribution examples.☆26Jun 20, 2023Updated 2 years ago
- [ICCV 2025] Language Driven Occupancy Prediction☆39Dec 23, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆27Mar 7, 2025Updated last year
- officical code for ECCV 2024 paper "Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection"☆14Jul 4, 2024Updated last year
- Extended Implementation of FastLGS☆16Dec 17, 2024Updated last year
- Learning to Count without Annotations☆23May 24, 2024Updated last year
- Official PyTorch Implementation of "Better Source, Better Flow: Learning Condition-Dependent Source Distribution for Flow Matching"☆31Mar 1, 2026Updated last month
- ☆16Jun 30, 2025Updated 10 months ago
- ☆67Sep 8, 2025Updated 7 months ago
- [CVPR 2025] This repository is the official implementation of "ProKeR: A Kernel Perspective on Few-Shot Adaptation of Large Vision-Langua…☆21Apr 1, 2025Updated last year
- [CVPR2025] Feat2GS: Probing Visual Foundation Models with Gaussian Splatting☆229Jul 25, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆26Apr 27, 2025Updated last year
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- [ICCV2025] ModPrompt: Visual Modality Prompt for Adapting Vision-Language Object Detectors☆26Jul 10, 2025Updated 9 months ago
- This is the implementation of RT-GS2☆26Oct 16, 2024Updated last year
- ☆39Jul 12, 2024Updated last year
- showing how to use CLIP-Vip to do video search☆16Nov 16, 2023Updated 2 years ago
- [CVPR2025] Exploring CLIP’s Dense Knowledge for Weakly Supervised Semantic Segmentation☆69Jun 21, 2025Updated 10 months ago