lorebianchi98/Talk2DINO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lorebianchi98/Talk2DINO)

lorebianchi98 / Talk2DINO

[ICCV 2025] Official repository of the paper "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation"

☆193

Alternatives and similar repositories for Talk2DINO

Users that are interested in Talk2DINO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aimagelab / DICE
View on GitHub
[ICCV 2025] What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models
☆15Nov 3, 2025Updated 8 months ago
Ruggero1912 / Patch-ioner
View on GitHub
[CVPR 2026] Official Repository of the Paper "One Patch to Caption Them All A Unified Zero-Shot Captioning Framework"
☆15Jun 4, 2026Updated last month
YuHengsss / Trident
View on GitHub
[ICCV2025] Harnessing CLIP, DINO and SAM for Open Vocabulary Segmentation
☆125Nov 22, 2025Updated 8 months ago
aimagelab / VHS
View on GitHub
[CVPR2026 Findings] VHS: Verifier on Hidden States, an efficient inference-time scaling verification framework for DiT-based image genera…
☆16Mar 25, 2026Updated 3 months ago
lorebianchi98 / CountingDINO
View on GitHub
[WACV 2026] Official implementation of the paper: “CountingDINO: A Training-free Pipeline for Exemplar-based Class-Agnostic Counting”
☆63Jun 22, 2026Updated 3 weeks ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zdk258 / CorrCLIP
View on GitHub
[ICCV 2025 Oral] CorrCLIP: Reconstructing Patch Correlations in CLIP for Open-Vocabulary Semantic Segmentation
☆70Aug 1, 2025Updated 11 months ago
mc-lan / ProxyCLIP
View on GitHub
[ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation
☆120Mar 26, 2025Updated last year
ciampluca / PrACo
View on GitHub
☆16May 19, 2026Updated 2 months ago
lorebianchi98 / FG-CLIP
View on GitHub
[CBMI 2024 Best Paper] Official repository of the paper "Is CLIP the main roadblock for fine-grained open-world perception?".
☆31May 12, 2025Updated last year
TilemahosAravanis / Retrieve-and-Segment
View on GitHub
[CVPR 2026 - Highlight] Official Implementation of "Retrieve and Segment: Are a Few Examples Enough to Bridge the Supervision Gap in Open…
☆25Jul 10, 2026Updated last week
MICV-yonsei / CASS
View on GitHub
[CVPR 2025] Official Pytorch Code for Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation
☆50Mar 27, 2025Updated last year
lorebianchi98 / FG-OVD
View on GitHub
[CVPR 2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detec…
☆68Apr 4, 2025Updated last year
RADSeg-OVSS / RADSeg
View on GitHub
[CVPR'26 Findings] Source code for "RADSeg Unleashing Parameter and Compute Efficient Zero-Shot Open-Vocabulary Segmentation Using Agglom…
☆60May 31, 2026Updated last month
phuselab / tppgaze
View on GitHub
☆17Feb 20, 2025Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
chenqi1126 / FreeCP
View on GitHub
[ICCV 2025] Training-Free Class Purification for Open-Vocabulary Semantic Segmentation
☆16Nov 2, 2025Updated 8 months ago
aimagelab / DiCO
View on GitHub
[BMVC 2024 Oral ✨] Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization
☆20Sep 11, 2024Updated last year
vladan-stojnic / LPOSS
View on GitHub
Code for LPOSS: Label Propagation Over Patches and Pixels for Open-vocabulary Semantic Segmentation (CVPR2025)
☆24Nov 8, 2025Updated 8 months ago
letitiabanana / PnP-OVSS
View on GitHub
[CVPR'24] Code for Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models
☆18Jul 22, 2024Updated 2 years ago
aimagelab / ReflectiVA
View on GitHub
[CVPR 2025] Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering
☆56Jul 14, 2025Updated last year
aimagelab / ReT-2
View on GitHub
Recurrence Meets Transformers for Universal Multimodal Retrieval
☆15Dec 15, 2025Updated 7 months ago
likyoo / SimFeatUp
View on GitHub
☆26Apr 27, 2025Updated last year
PGSmall / PEARL
View on GitHub
Official code for CVPR2026 "PEARL: Geometry Aligns Semantics for Training-Free Open-Vocabulary Semantic Segmentation"
☆20Mar 24, 2026Updated 3 months ago
aimagelab / freeda
View on GitHub
FreeDA: Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation (CVPR 2024)
☆50Aug 28, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hustvl / MaskAdapter
View on GitHub
[CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"
☆135Oct 23, 2025Updated 8 months ago
aimagelab / pacscore
View on GitHub
[CVPR 2023 & IJCV 2025] Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation
☆66Jul 29, 2025Updated 11 months ago
rpng / online_lang_splatting
View on GitHub
[ICCV 2025] Official Implementation of "Online Language Splatting"
☆65Jan 6, 2026Updated 6 months ago
valeoai / Franca
View on GitHub
Official code of Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning
☆278Sep 24, 2025Updated 9 months ago
SJTU-DeepVisionLab / HyperCLIP
View on GitHub
[CVPR 2025] Understanding Fine-tuning CLIP for Open-vocabulary Semantic Segmentation in Hyperbolic Space
☆41Jul 18, 2025Updated last year
likyoo / SegEarth-OV
View on GitHub
[CVPR 2025 Oral] SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images
☆272Jul 9, 2025Updated last year
wimmerth / anyup
View on GitHub
[ICLR '26 Oral] Official repository of the paper "AnyUp: Universal Feature Upsampling".
☆569Apr 17, 2026Updated 3 months ago
naomikombol / SPAR
View on GitHub
SPAR: Single-Pass Any-Resolution ViT for Open-vocabulary Segmentation
☆26Jun 29, 2026Updated 3 weeks ago
kaist-cvml / part-catseg
View on GitHub
[CVPR 2025] Fine-Grained Image-Text Correspondence with Cost Aggregation for Open-Vocabulary Part Segmentation
☆30Nov 17, 2025Updated 8 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
enricollen / fastRTC-voice-agent
View on GitHub
Realtime voice-enabled AI assistant that can engage in natural conversations
☆31Nov 23, 2025Updated 7 months ago
codiceSpaghetti / numpyGPT
View on GitHub
A from-scratch GPT built with NumPy and Python’s standard library. No autograd, no frameworks: every layer is re-implemented with its own…
☆20Nov 23, 2025Updated 7 months ago
aimagelab / ScanDiff
View on GitHub
This is the official repository for the paper "Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction". ICCV …
☆27May 13, 2026Updated 2 months ago
aimagelab / MaPeT
View on GitHub
Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training
☆16Jul 1, 2025Updated last year
Becomebright / MTV
View on GitHub
Revisiting Multi-Task Visual Representation Learning
☆22Jan 21, 2026Updated 6 months ago
naver / panst3r
View on GitHub
PanSt3R: Multi-view Consistent Panoptic Segmentation (official code)
☆78Mar 20, 2026Updated 4 months ago
MarkYu98 / madpose
View on GitHub
[CVPR 2025 Highlight] Official implementation of the solvers and estimators proposed in the paper "Relative Pose Estimation through Affin…
☆237Apr 8, 2025Updated last year