mikkoim / dinotoolLinks
Command-line tool for extracting DINO, CLIP, and SigLIP2 features for images and videos
☆28Updated last month
Alternatives and similar repositories for dinotool
Users that are interested in dinotool are comparing it to the libraries listed below
Sorting:
- ☆102Updated 3 months ago
- ☆59Updated last year
- Timm model explorer☆41Updated last year
- Official code repository for ICML 2025 paper: "ExPLoRA: Parameter-Efficient Extended Pre-training to Adapt Vision Transformers under Doma…☆38Updated 3 weeks ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆36Updated last year
- EdgeSAM model for use with Autodistill.☆27Updated last year
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆60Updated 8 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated last year
- ☆76Updated last month
- ☆68Updated 2 weeks ago
- Estimate dataset difficulty and detect label mistakes using reconstruction error ratios!☆25Updated 6 months ago
- Compare Savant and PyTorch performance☆13Updated last year
- [NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection☆57Updated last year
- Official repository for the paper "SANSA: Unleashing the Hidden Semantics in SAM2 for Few-Shot Segmentation."☆105Updated 3 weeks ago
- Code and weights for the paper "Cluster and Predict Latents Patches for Improved Masked Image Modeling"☆113Updated 3 months ago
- ☆78Updated 9 months ago
- ☆42Updated 6 months ago
- GroundedSAM Base Model plugin for Autodistill☆51Updated last year
- Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, …☆70Updated 3 months ago
- Video-LlaVA fine-tune for CinePile evaluation☆51Updated last year
- Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models".☆102Updated last year
- Efficient parallelizable algorithms for multidimensional arrays to speed up your data pipelines☆22Updated last week
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆101Updated 10 months ago
- A FiftyOne Plugin that allows you to search across any modality in your videos!☆21Updated 2 months ago
- NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…☆43Updated 9 months ago
- OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024☆60Updated 5 months ago
- [CVPR'24 Highlight] PyTorch Implementation of Object Recognition as Next Token Prediction☆180Updated 3 months ago
- auto_labeler - An all-in-one library to automatically label vision data☆16Updated 6 months ago
- ☆24Updated 9 months ago
- Supercharge Your PyTorch Image Models: Bag of Tricks to 8x Faster Inference with ONNX Runtime & Optimizations☆23Updated 10 months ago