sovit-123 / dinov3_stackLinks
A repository to apply DINOv3 models for different downstream tasks: image classification, semantic segmentation, object detection.
☆93Updated 3 months ago
Alternatives and similar repositories for dinov3_stack
Users that are interested in dinov3_stack are comparing it to the libraries listed below
Sorting:
- SegDINO: An Efficient Design for Medical and Natural Image Segmentation with DINO-V3☆229Updated last month
- Official implementation of the WACV 2025 ( Oral ) paper. RT-DETRv3: Real-time End-to-End Object Detection with Hierarchical Dense Positiv…☆316Updated 10 months ago
- Official code for "No time to train! Training-Free Reference-Based Instance Segmentation"☆275Updated last week
- Testing adaptation of the DINOv2/3 encoders for vision tasks with Low-Rank Adaptation (LoRA)☆426Updated 3 months ago
- [CVPR 2025 Highlight] Official code and models for Encoder-only Mask Transformer (EoMT).☆526Updated 3 months ago
- Code for PID: Physics-Informed Diffusion Model for Infrared Image Generation☆152Updated 4 months ago
- Official Implementation of Upsample Anything: A Simple and Hard to Beat Baseline for Feature Upsampling☆194Updated 2 months ago
- The repository provides code for training/fine tune the Meta Segment Anything Model 2 (SAM 2)☆290Updated last year
- Official implementation of RT-DETRv4: Painlessly Furthering Real-Time Object Detection with Vision Foundation Models☆280Updated 3 weeks ago
- ☆193Updated 8 months ago
- [TITS 2024] You Only Look Clusters for Tiny Object Detection in Aerial Images☆114Updated last year
- ☆75Updated 2 weeks ago
- [DEIMv2] Real Time Object Detection Meets DINOv3☆1,463Updated last month
- [AAAI'25] Official Code for “Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community"☆225Updated 3 months ago
- LSNet: See Large, Focus Small [CVPR 2025]☆480Updated 10 months ago
- ☆102Updated 9 months ago
- 🚀🚀🚀Official code for the paper "YOLO-Master: MOE-Accelerated with Specialized Transformers for Enhanced Real-time Detection." *(YO…☆340Updated this week
- The source code of IEEE TPAMI 2025 "Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation".☆213Updated last year
- Official implementation of the CVPR 2024 paper ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense…☆339Updated last year
- DINOv3训练示例☆138Updated 2 months ago
- [NeurIPS 2024 🔥] DI-MaskDINO: A Joint Object Detection and Instance Segmentation Model☆45Updated last year
- Valeo Anomaly Dataset (VAD)☆33Updated 5 months ago
- InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition (NeurIPS 2025)☆107Updated last week
- Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection☆95Updated 10 months ago
- The source code of IEEE TPAMI 2025 "Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation".☆117Updated last year
- DinoV2 Backbone for YOLO 🚀☆61Updated last year
- [CVPR 2025] "A Distractor-Aware Memory for Visual Object Tracking with SAM2"☆454Updated 3 months ago
- [CVPR 2024 Workshops] SERNet-Former: Semantic Segmentation by Efficient Residual Network with Attention-Boosting Gates and Attention-Fusi…☆71Updated last year
- (CVPR 2025 highlight✨) Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of La…☆553Updated last month
- One summary of efficient segment anything models☆118Updated last year