IDEA-Research / DINO-X-API
☆124Updated this week
Related projects ⓘ
Alternatives and complementary repositories for DINO-X-API
- ☆35Updated this week
- ☆62Updated 11 months ago
- CAVIS: Context-Aware Video Instance Segmentation☆60Updated last month
- ☆150Updated 2 months ago
- ☆211Updated 4 months ago
- ☆29Updated last month
- ☆93Updated 4 months ago
- (NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection☆110Updated 6 months ago
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆95Updated 3 months ago
- This repo contains the code for our paper Towards Open-Ended Visual Recognition with Large Language Model☆90Updated 4 months ago
- The official implementation of "Segment Anything with Multiple Modalities".☆68Updated 2 months ago
- [ECCV 2024] SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding☆40Updated last month
- [ECCV2024] This is an official implementation for "PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model"☆193Updated this week
- ☆102Updated 5 months ago
- [ICCV'23] Cascade-DETR: Delving into High-Quality Universal Object Detection☆95Updated last year
- [NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection☆52Updated 9 months ago
- [ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces☆235Updated 10 months ago
- InstaGen: Enhancing Object Detection by Training on Synthetic Dataset, CVPR2024☆73Updated 7 months ago
- DynRefer: Delving into Region-level Multi-modality Tasks via Dynamic Resolution☆39Updated 2 weeks ago
- 1-shot image segmentation using Stable Diffusion☆129Updated 8 months ago
- [NeurIPS 2024] Official implementation of the paper "Interfacing Foundation Models' Embeddings"☆112Updated 3 months ago
- Official Code for Tracking Any Object Amodally☆113Updated 4 months ago
- [ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation☆57Updated 2 months ago
- 【ECCV2024】The official repo of Griffon series☆105Updated 2 weeks ago
- [IEEE TCSVT] Official Pytorch Implementation of CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation.☆35Updated 3 weeks ago
- ☆23Updated 3 weeks ago
- [NeurIPS2022] This is the official implementation of the paper "Expediting Large-Scale Vision Transformer for Dense Prediction without Fi…☆82Updated last year
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory☆52Updated 2 months ago
- ☆84Updated 4 months ago
- Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed☆58Updated 3 weeks ago