IDEA-Research / DINO-X-API
DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding
☆791Updated 2 weeks ago
Alternatives and similar repositories for DINO-X-API:
Users that are interested in DINO-X-API are comparing it to the libraries listed below
- Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2☆1,518Updated 3 weeks ago
- Official Implementation of CVPR24 highligt paper: Matching Anything by Segmenting Anything☆1,152Updated 2 months ago
- Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series☆853Updated 5 months ago
- Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video”☆448Updated last month
- Official repository for "AM-RADIO: Reduce All Domains Into One"☆892Updated this week
- [CVPR 2024] Official implementation of the paper "Visual In-context Learning"☆427Updated 9 months ago
- Run Segment Anything Model 2 on a live video stream☆257Updated last month
- [NeurIPS 2024] Code release for "Segment Anything without Supervision"☆441Updated 3 months ago
- [ECCV 2024] Tokenize Anything via Prompting☆557Updated last month
- This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detectio…☆502Updated 6 months ago
- Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion☆283Updated this week
- A distilled Segment Anything (SAM) model capable of running real-time with NVIDIA TensorRT☆696Updated last year
- [ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"☆679Updated 11 months ago
- [ICLR'24] Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching☆463Updated last month
- [ICCV 2023] Tracking Anything with Decoupled Video Segmentation☆1,302Updated 5 months ago
- CoRL 2024☆363Updated 2 months ago
- Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"☆331Updated this week
- Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024☆1,431Updated 6 months ago
- [CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale☆1,073Updated 2 months ago
- [ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"☆2,446Updated 5 months ago
- A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..☆523Updated last month
- ☆218Updated 6 months ago
- Efficient Track Anything☆441Updated last week
- EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything☆2,237Updated 3 weeks ago
- Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"☆937Updated 5 months ago
- RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)☆331Updated 4 months ago
- [AAAI 2025] Official PyTorch implementation of "TinySAM: Pushing the Envelope for Efficient Segment Anything Model"☆424Updated last week
- Code release for our CVPR 2023 paper "Detecting Everything in the Open World: Towards Universal Object Detection".☆554Updated last year
- Depth Any Video with Scalable Synthetic Data☆439Updated last month
- [ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"☆585Updated 5 months ago