autodistill / autodistill-grounded-sam-2
Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.
☆121Updated 9 months ago
Alternatives and similar repositories for autodistill-grounded-sam-2
Users that are interested in autodistill-grounded-sam-2 are comparing it to the libraries listed below
Sorting:
- Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark☆124Updated last month
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆63Updated 9 months ago
- Official Code for Tracking Any Object Amodally☆118Updated 10 months ago
- EdgeSAM model for use with Autodistill.☆26Updated 11 months ago
- [ECCV 2024 & NeurIPS 2024] Official implementation of the paper TAPTR & TAPTRv2 & TAPTRv3☆263Updated 5 months ago
- [CVPR 2025] "A Distractor-Aware Memory for Visual Object Tracking with SAM2"☆289Updated last week
- Official Code for "MITracker: Multi-View Integration for Visual Object Tracking"☆67Updated this week
- ☆40Updated 3 months ago
- Scaling Vision Pre-Training to 4K Resolution☆157Updated 2 weeks ago
- ☆67Updated last month
- Codebase for the Recognize Anything Model (RAM)☆78Updated last year
- [NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection☆56Updated last year
- Grounded Tracking for Streaming Videos☆103Updated 7 months ago
- Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"☆405Updated last month
- [ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces☆240Updated 3 months ago
- SSA + FastSAM/Semantic Fast Segment Anything , or Fast Semantic Segment Anything☆99Updated last year
- YOLO-World + EfficientViT SAM☆98Updated last year
- Pytorch Implementation of "SMITE: Segment Me In TimE" (ICLR 2025)☆208Updated last month
- ☆71Updated last month
- ☆71Updated last month
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.☆46Updated 8 months ago
- This method uses Segment Anything and CLIP to ground and count any object that matches a custom text prompt, without requiring any point …☆156Updated 2 years ago
- Muggled SAM: Segmentation without the magic☆133Updated last month
- [ECCV 2024] Decomposition Betters Tracking Everything Everywhere☆113Updated 10 months ago
- Real-time pose estimation pipeline with 🤗 Transformers☆59Updated 3 months ago
- ☆267Updated last month
- [CVPR25] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"☆175Updated last month
- Odd-One-Out: Anomaly Detection by Comparing with Neighbors (CVPR25)☆37Updated 5 months ago
- ☆189Updated 3 months ago
- [ICLR 2025 oral] RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything☆246Updated last month