SonicCodes / dinov2-clipLinks
dinov2 features aligned with CLIP
☆18Updated 11 months ago
Alternatives and similar repositories for dinov2-clip
Users that are interested in dinov2-clip are comparing it to the libraries listed below
Sorting:
- Official code for "DiffCut: Catalyzing Zero-Shot Semantic Segmentation with Diffusion Features and Recursive Normalized Cut", NeurIPS 202…☆39Updated 5 months ago
- [ECCV 2024] DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control☆80Updated 7 months ago
- [CVPR 2025] DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception☆62Updated 2 weeks ago
- official implementation of Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation☆12Updated last year
- [ECCV 2024 Oral] Official implementation of the paper "PDiscoFormer: Relaxing Part Discovery Constraints with Vision Transformers"☆16Updated 3 months ago
- Open-Vocabulary Panoptic Segmentation☆24Updated 2 weeks ago
- ☆29Updated 5 months ago
- [ICLR 2025] SAMRefiner: Taming Segment Anything Model for Universal Mask Refinement☆57Updated 2 months ago
- ☆26Updated 8 months ago
- Implementation of Zero-Shot Video Semantic Segmentation [CVPR 2025]☆49Updated 4 months ago
- [NeurIPS 2024] Understanding Multi-Granularity for Open-Vocabulary Part Segmentation☆50Updated 6 months ago
- ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements [CVPRW 2025]☆23Updated 2 months ago
- [NeurIPS2023] 3D-OWIS is capable of detecting unknown instances in inference, and progressively learning novel classes in the process of …☆68Updated last year
- [ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".☆24Updated 8 months ago
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory☆58Updated 4 months ago
- Official Pytorch implementation of "Vision Transformers Don't Need Trained Registers"☆61Updated last week
- ☆32Updated last year
- ☆32Updated last year
- SINDER: Repairing the Singular Defects of DINOv2 (ECCV 2024 Oral)☆37Updated 9 months ago
- [AAAI 2025] Official pytorch implementation of "Diffusion Model Patching via Mixture-of-Prompts"☆12Updated 6 months ago
- Official Implementation for CVPR 2024 paper: CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor☆108Updated last year
- Official code repository for paper: "ExPLoRA: Parameter-Efficient Extended Pre-training to Adapt Vision Transformers under Domain Shifts"☆31Updated 8 months ago
- Official implementation of "Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive" (ICLR 2024)☆54Updated 9 months ago
- [ICCV2025] Harnessing CLIP, DINO and SAM for Open Vocabulary Segmentation☆58Updated this week
- Code for "Don’t drop your samples! Coherence-aware training benefits Conditional diffusion" CVPR 2024 Highlight☆53Updated 3 months ago
- Code for "How far can we go with ImageNet for Text-to-Image generation?" paper☆88Updated last month
- Downstream semantic segmentation evaluation of DGInStyle.☆25Updated last year
- [ECCV 2024] Official Release of SILC: Improving vision language pretraining with self-distillation☆44Updated 8 months ago
- Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better☆29Updated 2 weeks ago
- [CVPR25] CoLLM: A Large Language Model for Composed Image Retrieval☆25Updated 3 months ago