mikkoim / dinotoolLinks

Command-line tool for extracting DINOv3, CLIP, SigLIP2, RADIO, features for images and videos

☆55

Alternatives and similar repositories for dinotool

Users that are interested in dinotool are comparing it to the libraries listed below

Sorting:

PaulCouairon / JAFAR
[NeurIPS 2025] Official code for JAFAR: Jack up Any Feature at Any Resolution
☆204Updated 2 weeks ago
saksham-s / lift
This is the official code release for [LiFT: A Surprisingly Simple Lightweight Feature Transform for Dense ViT Descriptors](https://arxiv…
☆42Updated last year
Chipmunk-g4 / Template-Matching-and-Regression
☆42Updated last month
aminebdj / 3D-OWIS
[NeurIPS2023] 3D-OWIS is capable of detecting unknown instances in inference, and progressively learning novel classes in the process of …
☆67Updated 2 years ago
QianWangX / VidSeg_diffusion
Implementation of Zero-Shot Video Semantic Segmentation [CVPR 2025]
☆55Updated 9 months ago
ClaudiaCuttano / SANSA
[NeurIPS 2025 Spotlight] "SANSA: Unleashing the Hidden Semantics in SAM2 for Few-Shot Segmentation."
☆165Updated last week
visinf / cups
Scene-Centric Unsupervised Panoptic Segmentation (CVPR 2025 Highlight)
☆74Updated 2 months ago
SimonZeng7108 / efficientsam3
☆134Updated last week
google-deepmind / tips
☆107Updated 7 months ago
karazijal / lrtl
☆44Updated 10 months ago
Amshaker / MAVOS
[WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory
☆59Updated 9 months ago
gmberton / image-retrieval
All You Need to Know About Image Retrieval: a repo to automagically download datasets and run experiments
☆62Updated 8 months ago
cvlab-kaist / locotrack
Official implementation of "Local All-Pair Correspondence for Point Tracking" (ECCV 2024)
☆201Updated 7 months ago
NVlabs / FeatSharp
☆41Updated 5 months ago
YuHengsss / Trident
[ICCV2025] Harnessing CLIP, DINO and SAM for Open Vocabulary Segmentation
☆96Updated 2 weeks ago
wimmerth / anyup
Repository of the paper "AnyUp: Universal Feature Upsampling".
☆409Updated last week
NVlabs / PS3
Scaling Vision Pre-Training to 4K Resolution
☆217Updated 3 months ago
autodistill / autodistill-grounded-sam-2
Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.
☆134Updated last year
andrehuang / loftup
[ICCV'25 oral] Official Code for "LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models"
☆236Updated 3 weeks ago
Sta8is / DINO-Foresight
[NeurIPS 2025] Official Implementation of DINO-Foresight: Looking into the Future with DINO
☆134Updated last week
valeoai / Franca
Official code of Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning
☆255Updated 2 months ago
Borda / finetune-RF-DETR
Modular CLI pipeline for fine‑tuning RF‑DETR object detection models on custom datasets.
☆29Updated this week
ymq2017 / entitysam
[CVPR'2025] EntitySAM: Segment Everything in Video
☆54Updated 4 months ago
KevinZ0217 / fast_dinov2
[NeurIPS '25] FastDINOv2: Frequency Based Curriculum Learning Improves Robustness and Training Speed
☆25Updated 4 months ago
wysoczanska / clip_dinoiser
Official implementation of 'CLIP-DINOiser: Teaching CLIP a few DINO tricks' paper.
☆264Updated last year
pablovela5620 / sam2-depthanything
☆76Updated 7 months ago
aliasgharkhani / SLiMe
1-shot image segmentation using Stable Diffusion
☆142Updated last year
Shengcao-Cao / HASSOD
[NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection
☆58Updated last year
namithap10 / xinc
☆19Updated last year
ClaudiaCuttano / SAMWISE
[CVPR 2025 Highlight] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"
☆350Updated 2 months ago