mikkoim / dinotoolLinks
Command-line tool for extracting DINOv3, CLIP, SigLIP2, RADIO, features for images and videos
☆60Updated 2 months ago
Alternatives and similar repositories for dinotool
Users that are interested in dinotool are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] Official code for JAFAR: Jack up Any Feature at Any Resolution☆211Updated last month
- ☆43Updated 2 months ago
- [NeurIPS 2025 Spotlight] "SANSA: Unleashing the Hidden Semantics in SAM2 for Few-Shot Segmentation."☆170Updated last week
- ☆41Updated 6 months ago
- This is the official code release for [LiFT: A Surprisingly Simple Lightweight Feature Transform for Dense ViT Descriptors](https://arxiv…☆43Updated last year
- Scene-Centric Unsupervised Panoptic Segmentation (CVPR 2025 Highlight)☆77Updated 3 months ago
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory☆59Updated 10 months ago
- ☆44Updated 10 months ago
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆134Updated last year
- This repository is for the first survey on SAM & SAM2 for Videos.☆52Updated 8 months ago
- Scaling Vision Pre-Training to 4K Resolution☆217Updated 4 months ago
- [ICCV2025] Harnessing CLIP, DINO and SAM for Open Vocabulary Segmentation☆101Updated last month
- Code of paper "A new baseline for edge detection: Make Encoder-Decoder great again"☆40Updated 6 months ago
- The Missing Point in Vision Transformers for Universal Image Segmentation☆56Updated last month
- [ICML 2025] Official Implementation for SimDINO/SimDINOv2☆182Updated 9 months ago
- Implementation of Zero-Shot Video Semantic Segmentation [CVPR 2025]☆55Updated 10 months ago
- [CVPR'2025] EntitySAM: Segment Everything in Video☆58Updated 5 months ago
- ☆205Updated last week
- ☆19Updated last year
- TIPS (ICLR'25): Text-Image Pretraining with Spatial Awareness☆110Updated 8 months ago
- [NeurIPS 2024] Official code for DiffCut: Catalyzing Zero-Shot Semantic Segmentation with Diffusion Features and Recursive Normalized Cut☆48Updated 11 months ago
- [ICCV'25 oral] Official Code for "LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models"☆243Updated last month
- (ICCV 2025) ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations☆125Updated last month
- dinov2 features aligned with CLIP☆20Updated last year
- ☆30Updated 3 months ago
- Segment This Thing is an efficient image segmentation models that uses a biologically-inspired foveated tokenization to reduce inference …☆55Updated 6 months ago
- [CVPR 2025 Highlight] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"☆356Updated 3 months ago
- [ICCVW 2025] Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation☆77Updated 2 months ago
- Official Code for: "DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency"☆30Updated 8 months ago
- This repo is the official implementation of iSeg: An Iterative Refinement-based Framework for Training-free Segmentation.☆39Updated last year