D-Robotics-AI-Lab / DOSOD
A Light-Weight Framework for Open-Set Object Detection with Decoupled Feature Alignment in Joint Space
☆51Updated this week
Alternatives and similar repositories for DOSOD:
Users that are interested in DOSOD are comparing it to the libraries listed below
- Official code for NetTrack [CVPR 2024]☆86Updated 10 months ago
- ☆24Updated 8 months ago
- [CVPR2024] BEVSee☆61Updated 6 months ago
- Project for "HyperSeg: Towards Universal Visual Segmentation with Large Language Model".☆87Updated last month
- The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.☆194Updated 3 weeks ago
- NIDS-Net: A unified framework for novel instance detection and segmentation☆47Updated 4 months ago
- ☆218Updated 6 months ago
- Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection☆64Updated 2 months ago
- ROS package for SOTA Computer Vision Models including SAM, Cutie, GroundingDINO, YOLO-World, VLPart, DEVA and MaskDINO.☆41Updated 5 months ago
- YOLO-World + EfficientViT SAM☆88Updated 11 months ago
- Open-Vocabulary Panoptic Segmentation☆20Updated 4 months ago
- ☆150Updated 6 months ago
- ☆14Updated 4 months ago
- [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding☆81Updated last month
- YOLO-UniOW: Efficient Universal Open-World Object Detection☆62Updated this week
- Fine tuning grounding Dino☆82Updated 3 weeks ago
- try to export sam2 to onnx.☆30Updated 3 months ago
- Tensorrt codebase to inference in c++ for all major neural arch using onnx☆25Updated 4 months ago
- ☆62Updated last year
- Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed☆73Updated 2 months ago
- ☆111Updated last year
- [IEEE TCSVT] Official Pytorch Implementation of CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation.☆37Updated last week
- [ECCV 2024] SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding☆49Updated 2 months ago
- Official Implementation of ECCV2024 paper: SLAck☆26Updated 4 months ago
- [ECCV2024] Official implementation of Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes☆77Updated 3 months ago
- One summary of efficient segment anything models☆84Updated 5 months ago
- ☆20Updated this week
- Our OpenYOLO3D model achieves state-of-the-art performance in Open Vocabulary 3D Instance Segmentation on ScanNet200 and Replica dataset…☆81Updated 4 months ago
- DynRefer: Delving into Region-level Multi-modality Tasks via Dynamic Resolution☆39Updated 2 months ago
- The repo of paper `RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation`☆78Updated 3 weeks ago