Shengcao-Cao / HASSOD
[NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection
☆47Updated 7 months ago
Related projects: ⓘ
- ☆29Updated last week
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory☆48Updated 2 weeks ago
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆80Updated last month
- Official implementation of the paper "Interfacing Foundation Models' Embeddings"☆107Updated 3 weeks ago
- Object Recognition as Next Token Prediction (CVPR 2024)☆153Updated last month
- Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models".☆90Updated 3 months ago
- 1-shot image segmentation using Stable Diffusion☆118Updated 6 months ago
- [IJCV 2024] MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation☆107Updated last month
- The official implementation of "Segment Anything with Multiple Modalities".☆53Updated 2 weeks ago
- EdgeSAM model for use with Autodistill.☆24Updated 3 months ago
- CAVIS: Context-Aware Video Instance Segmentation☆53Updated 2 months ago
- Tracking through Containers and Occluders in the Wild (CVPR 2023) - Official Implementation☆39Updated 3 months ago
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆65Updated 4 months ago
- [CVPR 2023 Highlight] Beyond mAP: Towards better evaluation of instance segmentation☆26Updated last year
- [CVPR 2024] Code release for "Unsupervised Universal Image Segmentation"☆163Updated 4 months ago
- Grounded-SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and …☆34Updated last year
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆33Updated 11 months ago
- ☆55Updated 3 months ago
- Benchmarking Panoptic Video Scene Graph Generation (PVSG), CVPR'23☆74Updated 4 months ago
- Official PyTorch implementation of "No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding"☆30Updated 4 months ago
- [NeurIPS2023] 3D-OWIS is capable of detecting unknown instances in inference, and progressively learning novel classes in the process of …☆66Updated 9 months ago
- [ECCV'24] Official Implementation of SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance☆93Updated 2 weeks ago
- Code for the paper Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models @ CVPR 2024☆53Updated 3 months ago
- ☆28Updated 7 months ago
- Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, …☆65Updated 11 months ago
- [NeurIPS2023] Code release for "Hierarchical Open-vocabulary Universal Image Segmentation"☆260Updated 5 months ago
- Diffusion base mining☆37Updated this week
- [ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation☆45Updated 2 weeks ago
- Official implementation of 'CLIP-DINOiser: Teaching CLIP a few DINO tricks' paper.☆168Updated 2 months ago
- Dataset and Code for the paper "AutoSoccerPose: Automated 3D posture Analysis of Soccer Shot Movements"☆29Updated 3 months ago