Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.
☆136Aug 7, 2024Updated last year
Alternatives and similar repositories for autodistill-grounded-sam-2
Users that are interested in autodistill-grounded-sam-2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A deep learning-powered visual navigation engine to enables autonomous navigation of pocket-size quadrotor - running on PULP☆13Oct 30, 2024Updated last year
- Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2☆3,509Nov 11, 2025Updated 6 months ago
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Feb 5, 2024Updated 2 years ago
- GroundedSAM Base Model plugin for Autodistill☆56Apr 17, 2024Updated 2 years ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆36Oct 18, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆37Sep 25, 2025Updated 7 months ago
- [Poster; ICLR 2026] [Oral; Neurips OPT2024] μLO: Compute-Efficient Meta-Generalization of Learned Optimizers☆16Apr 15, 2026Updated last month
- Unofficial implementation of Semantic-aware Guidance (S-CFG) for ComfyUI☆13Aug 8, 2024Updated last year
- ☆22Oct 25, 2024Updated last year
- EdgeYOLO + ROS 2 object detection package☆29Mar 28, 2023Updated 3 years ago
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX.☆62Apr 7, 2024Updated 2 years ago
- ☆11Jul 29, 2024Updated last year
- Images to inference with no labeling (use foundation models to train supervised models).☆2,701May 14, 2025Updated last year
- Chat with an AI simulation of anyone as easily as copy-pasting text into a folder!☆19Mar 4, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Odd-One-Out: Anomaly Detection by Comparing with Neighbors (CVPR25)☆55Jan 19, 2026Updated 4 months ago
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆19,212Apr 7, 2026Updated last month
- ☆31Dec 20, 2022Updated 3 years ago
- The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading t…☆28Jan 20, 2026Updated 4 months ago
- ☆12Jan 25, 2023Updated 3 years ago
- Mahalanobis Distance-based Multi-view Optimal Transport for Multi-view Crowd Localization, ECCV 2024☆15Nov 20, 2024Updated last year
- Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series☆1,112Jan 21, 2025Updated last year
- Simple Video Summarization using Text-to-Segment Anything (Florence2 + SAM2) This project provides a video processing tool that utilizes…☆10Feb 20, 2025Updated last year
- The sparse Bayesian learning sandbox☆11Jul 4, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆11Jan 18, 2024Updated 2 years ago
- Segment-Anything-2 (SAM 2) fine tune with COCO data☆15Aug 20, 2024Updated last year
- Run Segment Anything Model 2 on a live video stream☆585Jun 3, 2025Updated 11 months ago
- ODLabel is a powerful tool for zero-shot object detection, labeling and visualization. It provides an intuitive graphical user interface …☆11May 19, 2024Updated 2 years ago
- [NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection☆58Feb 8, 2024Updated 2 years ago
- The ESMStereo models are designed with low computational complexity to achieve an acceptable balance between accuracy and speed, which ma…☆60Aug 31, 2025Updated 8 months ago
- [Arxiv'25] SaLon3R: Structure-aware Long-term Generalizable 3D Reconstruction from Unposed Images☆48Oct 18, 2025Updated 7 months ago
- segment anything model (SAM) infer by ncnn on Android mobile phone☆30Oct 7, 2023Updated 2 years ago
- A collection of ROS 2 packages for autonomous systems, supporting self-driving cars, mobile robots, quadcopters, and other robotic platfo…☆31Jan 17, 2026Updated 4 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ACE-SLAM: Scene Coordinate Regression for Real-Time SLAM☆93Dec 17, 2025Updated 5 months ago
- Official implementation of ICCV 2023 paper "3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment"☆215Sep 7, 2023Updated 2 years ago
- SAM and lama inpaint,包含QT的GUI交互界面,实现了交互式可实时显示结果的画点、画框进行SAM,然后通过进行Inpaint,具体操作看readme里的视频。☆54Jan 30, 2024Updated 2 years ago
- ☆23Mar 31, 2025Updated last year
- a plugin-oriented framework for video structured. 国产程序员请加微信zhzhi78拉群交流。☆18May 28, 2024Updated last year
- Codes of paper "GraspSAM: When Segment Anything Model meets Grasp Detection", ICRA 2025☆49Feb 17, 2025Updated last year
- Official and maintained implementation of the paper "OSS-Net: Memory Efficient High Resolution Semantic Segmentation of 3D Medical Data" …☆26Apr 17, 2023Updated 3 years ago