autodistill / autodistill-dinov2Links
DINOv2 module for use with Autodistill.
☆14Updated last year
Alternatives and similar repositories for autodistill-dinov2
Users that are interested in autodistill-dinov2 are comparing it to the libraries listed below
Sorting:
- Official Code for "MITracker: Multi-View Integration for Visual Object Tracking"☆102Updated 2 months ago
- ☆132Updated 4 months ago
- 🏄 [ICLR 2025] OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with Transformer☆71Updated 3 weeks ago
- Official code for NetTrack [CVPR 2024]☆99Updated last year
- Combining "segment-anything" with MOT, it create the era of "MOTS"☆156Updated 2 years ago
- InstaGen: Enhancing Object Detection by Training on Synthetic Dataset, CVPR2024☆83Updated last year
- DVIS: Decoupled Video Instance Segmentation Framework☆152Updated last year
- AutoTrackAnything is a universal, flexible and interactive tool for insane automatic object tracking over thousands of frames. It is deve…☆85Updated last year
- OVTrack: Open-Vocabulary Multiple Object Tracking [CVPR 2023]☆107Updated 10 months ago
- [ICLR 2025 oral] RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything☆259Updated 4 months ago
- [ICCV 2023] The official PyTorch code for Group Pose: A Simple Baseline for End-to-End Multi-person Pose Estimation☆89Updated last year
- YOLO-World + EfficientViT SAM☆103Updated last year
- yolov8 model with SAM meta☆140Updated last year
- Video Mask Transfiner for High-Quality Video Instance Segmentation (ECCV'2022)☆30Updated 2 years ago
- Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection☆91Updated 5 months ago
- [ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark☆151Updated 4 months ago
- Includes the VideoCount dataset and CountVid code for the paper Open-World Object Counting in Videos.☆66Updated last month
- Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning☆112Updated last month
- ☆125Updated last year
- SSA + FastSAM/Semantic Fast Segment Anything , or Fast Semantic Segment Anything☆103Updated 2 months ago
- [ICCV 2023] ReST: A Reconfigurable Spatial-Temporal Graph Model for Multi-Camera Multi-Object Tracking☆160Updated last year
- ZBS: Zero-shot Background Subtraction via Instance-level Background Modeling and Foreground Selection (CVPR2023)☆52Updated last year
- (CVPR 2025 highlight✨) Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of La…☆385Updated this week
- [CVPR 2025] "A Distractor-Aware Memory for Visual Object Tracking with SAM2"☆369Updated 2 months ago
- Codebase for the Recognize Anything Model (RAM)☆82Updated last year
- CAVIS: Context-Aware Video Instance Segmentation☆89Updated 3 weeks ago
- Official Code for "Lifting Multi-View Detection and Tracking to the Bird’s Eye View"☆43Updated last year
- Official code of "ViTGaze: Gaze Following with Interaction Features in Vision Transformers"☆58Updated 5 months ago
- Recognize Any Regions☆122Updated 8 months ago
- An official implementation of "Hulk: A Universal Knowledge Translator for Human-Centric Tasks"☆136Updated 8 months ago