[ICCV 2025] Official repository of the paper "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation"
☆183Nov 10, 2025Updated 4 months ago
Alternatives and similar repositories for Talk2DINO
Users that are interested in Talk2DINO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation☆114Mar 26, 2025Updated last year
- [CVPR 2025] Official Pytorch Code for Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation☆50Mar 27, 2025Updated last year
- [CVPR 2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detec…☆67Apr 4, 2025Updated last year
- [CVPR'24] Code for Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models☆18Jul 22, 2024Updated last year
- [CBMI 2024 Best Paper] Official repository of the paper "Is CLIP the main roadblock for fine-grained open-world perception?".☆32May 12, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"☆128Oct 23, 2025Updated 5 months ago
- ☆17Feb 20, 2025Updated last year
- Public code for XFactor: Introduces the first geometry-free model to achieve true self-supervised / pose-free Novel View Synthesis (NVS) …☆142Mar 25, 2026Updated 2 weeks ago
- FreeDA: Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation (CVPR 2024)☆49Aug 28, 2024Updated last year
- Official Implementation of "CAT-Seg🐱: Cost Aggregation for Open-Vocabulary Semantic Segmentation"☆368Apr 11, 2024Updated last year
- [ICLR 2026] This is the official implementation of PG-Occ: Progressive Gaussian Transformer with Anisotropy-aware Sampling for Open Vocab…☆33Feb 19, 2026Updated last month
- [CVPR 2025 Highlight] SoMA: Singular Value Decomposed Minor Components Adaptation for Domain Generalizable Representation Learning☆62Jun 26, 2025Updated 9 months ago
- [CVPR 2023 & IJCV 2025] Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation☆67Jul 29, 2025Updated 8 months ago
- ☆23Oct 17, 2025Updated 5 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆23Nov 8, 2023Updated 2 years ago
- [CVPR24] Official Implementation of GEM (Grounding Everything Module)☆139Apr 10, 2025Updated 11 months ago
- [ICCV 2025] Official implementation of the paper: "Dynamic-DINO: Fine-Grained Mixture of Experts Tuning for Real-time Open-Vocabulary Obj…☆76Jul 29, 2025Updated 8 months ago
- A list of papers about point cloud based place recognition, also known as loop closure detection in SLAM (processing)☆10Jan 30, 2024Updated 2 years ago
- [ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning☆315Dec 21, 2025Updated 3 months ago
- An implementation of several unsupervised object discovery models (Slot Attention, SLATE, GNM) in PyTorch with pre-trained models.☆15May 26, 2025Updated 10 months ago
- Official code repository for "Video-Mined Task Graphs for Keystep Recognition in Instructional Videos" arXiv, 2023☆14Apr 1, 2024Updated 2 years ago
- cliptrase☆48Sep 1, 2024Updated last year
- Slot-TTA shows that test-time adaptation using slot-centric models can improve image segmentation on out-of-distribution examples.☆26Jun 20, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆27Mar 7, 2025Updated last year
- [ICCV 2025] Language Driven Occupancy Prediction☆39Dec 23, 2024Updated last year
- [BMVC2022, IJCV2023, Best Student Paper, Spotlight] Official codes for the paper "In the Eye of Transformer: Global-Local Correlation for…☆32Feb 22, 2025Updated last year
- officical code for ECCV 2024 paper "Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection"☆14Jul 4, 2024Updated last year
- Extended Implementation of FastLGS☆16Dec 17, 2024Updated last year
- Learning to Count without Annotations☆23May 24, 2024Updated last year
- The code implementation for the paper "DreamLifting: A Plug-in Module Lifting MV Diffusion Models for 3D Asset Generation".☆28Sep 1, 2025Updated 7 months ago
- ☆16Jun 30, 2025Updated 9 months ago
- ☆66Sep 8, 2025Updated 7 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [CVPR 2025] This repository is the official implementation of "ProKeR: A Kernel Perspective on Few-Shot Adaptation of Large Vision-Langua…☆21Apr 1, 2025Updated last year
- [CVPR2025] Feat2GS: Probing Visual Foundation Models with Gaussian Splatting☆228Jul 25, 2025Updated 8 months ago
- ☆26Apr 27, 2025Updated 11 months ago
- [CVPR 2025] Towards Training-free Anomaly Detection with Vision and Language Foundation Models☆90May 19, 2025Updated 10 months ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- [CVPR2025] Exploring CLIP’s Dense Knowledge for Weakly Supervised Semantic Segmentation☆68Jun 21, 2025Updated 9 months ago
- This is the implementation of RT-GS2☆26Oct 16, 2024Updated last year