SonicCodes / dinov2-clipLinks
dinov2 features aligned with CLIP
☆20Updated last year
Alternatives and similar repositories for dinov2-clip
Users that are interested in dinov2-clip are comparing it to the libraries listed below
Sorting:
- [ECCV 2024] DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control☆85Updated last year
- [NeurIPS 2024] Official code for DiffCut: Catalyzing Zero-Shot Semantic Segmentation with Diffusion Features and Recursive Normalized Cut☆48Updated 11 months ago
- [ICLR 2025] SAMRefiner: Taming Segment Anything Model for Universal Mask Refinement☆76Updated 8 months ago
- [ICCV2025] Harnessing CLIP, DINO and SAM for Open Vocabulary Segmentation☆101Updated last month
- Official Repo for PosSAM: Panoptic Open-vocabulary Segment Anything☆70Updated last year
- Project for "LaSagnA: Language-based Segmentation Assistant for Complex Queries".☆62Updated last year
- [IEEE TCSVT] Official Pytorch Implementation of CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation.☆46Updated 11 months ago
- ☆41Updated 6 months ago
- This repo is the official implementation of iSeg: An Iterative Refinement-based Framework for Training-free Segmentation.☆39Updated last year
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory☆59Updated 10 months ago
- [CVPR'2025] EntitySAM: Segment Everything in Video☆58Updated 5 months ago
- Official implementation of the WACV 2024 paper CLIP-DIY☆34Updated 2 years ago
- Open-Vocabulary Panoptic Segmentation☆27Updated 6 months ago
- [NeurIPS 2025] Official code for JAFAR: Jack up Any Feature at Any Resolution☆211Updated last month
- Official Implementation for CVPR 2024 paper: CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor☆110Updated last year
- Implementation of Zero-Shot Video Semantic Segmentation [CVPR 2025]☆55Updated 10 months ago
- ☆34Updated last month
- Code for "How far can we go with ImageNet for Text-to-Image generation?" paper☆94Updated last month
- The Missing Point in Vision Transformers for Universal Image Segmentation☆56Updated last month
- [ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".☆28Updated last year
- Code for the paper Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models @ CVPR 2024☆73Updated last year
- Instruct-CLIP: Improving Instruction-Guided Image Editing with Automated Data Refinement Using Contrastive Learning (CVPR 2025)☆32Updated 6 months ago
- ☆71Updated 2 years ago
- This repository is for the first survey on SAM & SAM2 for Videos.☆52Updated 8 months ago
- [NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone".☆42Updated 11 months ago
- [NeurIPS '25 Spotlight] Official Pytorch implementation of "Vision Transformers Don't Need Trained Registers"☆156Updated 3 months ago
- Concept Lancet: Image Editing with Compositional Representation Transplant (CVPR 2025)☆19Updated 9 months ago
- ☆34Updated last year
- [CVPR 2025] DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception☆147Updated 6 months ago
- [NeurIPS 2025 Spotlight] "SANSA: Unleashing the Hidden Semantics in SAM2 for Few-Shot Segmentation."☆170Updated last week