mikkoim / dinotoolLinks
Command-line tool for extracting DINO, CLIP, and SigLIP2 features for images and videos
☆32Updated last month
Alternatives and similar repositories for dinotool
Users that are interested in dinotool are comparing it to the libraries listed below
Sorting:
- ☆76Updated 2 months ago
- [NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection☆58Updated last year
- Cache PyTorch module outputs on-the-fly☆44Updated 4 months ago
- Efficient parallelizable algorithms for multidimensional arrays to speed up your data pipelines☆22Updated last month
- EdgeSAM model for use with Autodistill.☆29Updated last year
- [ICCV25] Official Implementation of LeGrad☆78Updated 11 months ago
- Estimate dataset difficulty and detect label mistakes using reconstruction error ratios!☆26Updated 8 months ago
- Compare Savant and PyTorch performance☆13Updated last year
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆36Updated last year
- Timm model explorer☆41Updated last year
- Official code repository for ICML 2025 paper: "ExPLoRA: Parameter-Efficient Extended Pre-training to Adapt Vision Transformers under Doma…☆44Updated 3 weeks ago
- ☆103Updated 5 months ago
- Induce brain-like topographic structure in your neural networks☆69Updated last month
- ☆59Updated last year
- ☆44Updated 7 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated last year
- ☆71Updated 2 months ago
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆61Updated 9 months ago
- This repo is the official implementation of iSeg: An Iterative Refinement-based Framework for Training-free Segmentation.☆37Updated 9 months ago
- [CVPR 2025 Highlight] Official code and models for Encoder-only Mask Transformer (EoMT).☆380Updated this week
- Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, …☆69Updated 4 months ago
- A FiftyOne Plugin that allows you to search across any modality in your videos!☆21Updated 3 months ago
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆128Updated last year
- PyTorch Implementation of Object Recognition as Next Token Prediction [CVPR'24 Highlight]☆181Updated 4 months ago
- Fine-tuning OpenAI CLIP Model for Image Search on medical images☆77Updated 3 years ago
- Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models" ICLR 2024☆104Updated last year
- A tool for converting computer vision label formats.☆73Updated this week
- Lightweight, open-source, high-performance Yolo implementation☆43Updated 3 months ago
- [NeurIPS 2024] Official implementation of the paper "Interfacing Foundation Models' Embeddings"☆125Updated last year
- ☆69Updated last year