IDEA-Research / Grounded-SAM-2
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
☆1,918Updated last week
Alternatives and similar repositories for Grounded-SAM-2:
Users that are interested in Grounded-SAM-2 are comparing it to the libraries listed below
- [ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"☆2,557Updated 8 months ago
- DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding☆968Updated last week
- Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series☆921Updated 2 months ago
- SAM with text prompt☆2,083Updated last month
- Official Implementation of CVPR24 highligt paper: Matching Anything by Segmenting Anything☆1,246Updated 4 months ago
- [ICCV 2023] Tracking Anything with Decoupled Video Segmentation☆1,355Updated 8 months ago
- Official repository for "AM-RADIO: Reduce All Domains Into One"☆955Updated last week
- [ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"☆7,765Updated 7 months ago
- Efficient vision foundation models for high-resolution generation and perception.☆2,766Updated this week
- [CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation☆835Updated 4 months ago
- Fine-tune SAM (Segment Anything Model) for computer vision tasks such as semantic segmentation, matting, detection ... in specific scena…☆821Updated last year
- Segment Anything in High Quality [NeurIPS 2023]☆3,855Updated 3 months ago
- EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything☆2,303Updated 3 months ago
- An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary alg…☆2,915Updated 11 months ago
- Grounded Language-Image Pre-training☆2,363Updated last year
- This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detectio…☆570Updated 9 months ago
- [CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segme…☆1,292Updated last year
- [NeurIPS 2024] Code release for "Segment Anything without Supervision"☆456Updated 5 months ago
- Run Segment Anything Model 2 on a live video stream☆339Updated 2 months ago
- Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024☆1,478Updated 9 months ago
- Tracking and collecting papers/projects/others related to Segment Anything.☆1,601Updated 3 weeks ago
- [CVPR 2023] OneFormer: One Transformer to Rule Universal Image Segmentation☆1,579Updated 6 months ago
- YOLOE: Real-Time Seeing Anything☆1,022Updated this week
- Adapting Meta AI's Segment Anything to Downstream Tasks with Adapters and Prompts☆1,155Updated 3 months ago
- [ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"☆701Updated last year
- Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).☆2,221Updated last year
- This repository is for the first comprehensive survey on Meta AI's Segment Anything Model (SAM).☆929Updated this week
- [NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"☆4,549Updated 7 months ago
- EVA Series: Visual Representation Fantasies from BAAI☆2,459Updated 8 months ago
- The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foun…☆1,702Updated 3 weeks ago