hustvl / WeakCLIPLinks
[IJCV 2024]
☆17Updated 11 months ago
Alternatives and similar repositories for WeakCLIP
Users that are interested in WeakCLIP are comparing it to the libraries listed below
Sorting:
- LENS: Learning to Segment Anything with Unified Reinforced Reasoning☆53Updated this week
- [ICCV 2025] GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding☆68Updated 4 months ago
- [ICCV 23] A Simple Vision Transformer for Weakly Semi-supervised 3D Object Detection☆12Updated last year
- [TCSVT] state-of-the-art open vocabulary detector on COCO/LVIS/V3Det☆32Updated 4 months ago
- Segment Anything with Deictic Prompting☆27Updated 5 months ago
- [AAAI 2025] Official implementation of the paper "EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation"☆32Updated 10 months ago
- Official implementation of "Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence"☆48Updated this week
- Open-Vocabulary Panoptic Segmentation☆27Updated 4 months ago
- [ACM MM 2024] WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition☆57Updated 6 months ago
- (NeurIPS 2024) Official repository of paper "Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models"☆32Updated 7 months ago
- [CVPR'25] Official implementation of "Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation"☆38Updated 3 weeks ago
- ☆16Updated last year
- [ICCV 2023] Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment☆43Updated 2 years ago
- The official implementation of "PixelThink: Towards Efficient Chain-of-Pixel Reasoning" (arXiv 2025)☆38Updated 4 months ago
- [IEEE TCSVT] Official Pytorch Implementation of CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation.☆47Updated 9 months ago
- officical code for ECCV 2024 paper "Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection"☆14Updated last year
- ☆33Updated 3 weeks ago
- Official Repo for PosSAM: Panoptic Open-vocabulary Segment Anything☆69Updated last year
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆94Updated 7 months ago
- Official repository of the paper "High-Quality Mask Tuning Matters for Open-Vocabulary Segmentation"☆42Updated 7 months ago
- Code release for "Weakly Supervised Open-Vocabulary Object Detection", AAAI2024☆35Updated last year
- [ICCV2025] Official repository of the paper "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary…☆118Updated last week
- ☆13Updated 10 months ago
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆35Updated last year
- Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning☆30Updated last year
- ☆51Updated 5 months ago
- ☆30Updated last year
- Visual Grounding with Multi-modal Conditional Adaptation (ACMMM 2024 Oral)☆24Updated 4 months ago
- [AAAI 2024] The official implementation of the paper "3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Refer…☆43Updated last year
- [CVPR 2025] DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolution☆52Updated 7 months ago