Shengcao-Cao / HASSODLinks
[NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection
☆58Updated last year
Alternatives and similar repositories for HASSOD
Users that are interested in HASSOD are comparing it to the libraries listed below
Sorting:
- Official code for CAVIS: Context-Aware Video Instance Segmentation☆91Updated last month
- Odd-One-Out: Anomaly Detection by Comparing with Neighbors (CVPR25)☆49Updated 10 months ago
- Official Code for Tracking Any Object Amodally☆120Updated last year
- PyTorch Implementation of Object Recognition as Next Token Prediction [CVPR'24 Highlight]☆180Updated 5 months ago
- EdgeSAM model for use with Autodistill.☆29Updated last year
- Tracking through Containers and Occluders in the Wild (CVPR 2023) - Official Implementation☆41Updated last year
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory☆58Updated 8 months ago
- ☆25Updated last year
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆37Updated 2 years ago
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆132Updated last year
- [ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark☆169Updated 2 weeks ago
- ☆78Updated 6 months ago
- [NeurIPS 2024] Official implementation of the paper "Interfacing Foundation Models' Embeddings"☆125Updated last year
- [ICCVW 2025] Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation☆74Updated last week
- [CVPR 2025 Highlight] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"☆331Updated last month
- ☆36Updated 2 weeks ago
- 🤩 An AWESOME Curated List of Papers, Workshops, Datasets, and Challenges from CVPR 2024☆143Updated last year
- Code of paper "A new baseline for edge detection: Make Encoder-Decoder great again"☆40Updated 4 months ago
- ☆83Updated 7 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆67Updated last year
- ☆44Updated 8 months ago
- (ICCV 2025) ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations☆112Updated 2 weeks ago
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆101Updated last year
- [NeurIPS2022] This is the official implementation of the paper "Expediting Large-Scale Vision Transformer for Dense Prediction without Fi…☆85Updated 2 years ago
- Grounded-SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and …☆41Updated 6 months ago
- 1-shot image segmentation using Stable Diffusion☆141Updated last year
- [ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces☆236Updated 8 months ago
- Timm model explorer☆42Updated last year
- [ICCV2025] Harnessing CLIP, DINO and SAM for Open Vocabulary Segmentation☆87Updated 4 months ago
- [IJCV 2024] MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation☆128Updated last year