xxyzll / UMBLinks
UMB: Understanding Model Behavior for Open-World object Detection (NeurIPS 2024)
☆11Updated last year
Alternatives and similar repositories for UMB
Users that are interested in UMB are comparing it to the libraries listed below
Sorting:
- [ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.☆59Updated 2 months ago
- [NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion☆100Updated 3 months ago
- Visual Grounding with Multi-modal Conditional Adaptation (ACMMM 2024 Oral)☆26Updated 7 months ago
- Official implementation of ResCLIP: Residual Attention for Training-free Dense Vision-language Inference☆60Updated 3 months ago
- ☆27Updated 2 years ago
- [CVPR 2025] Official PyTorch Code for "DPC: Dual-Prompt Collaboration for Tuning Vision-Language Models"☆40Updated last week
- [NeurIPS 2024] OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.☆29Updated 2 months ago
- ☆23Updated last year
- ☆13Updated last year
- code for FineLIP☆38Updated 2 months ago
- [AAAI2025 selected as oral] - Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints☆44Updated 7 months ago
- This repository contains the implementation for the paper "Revisiting Few Shot Object Detection with Vision-Language Models"☆89Updated 8 months ago
- Code Implementation of "Simple Image-level Classification Improves Open-vocabulary Object Detection" (AAAI'24)☆29Updated 2 years ago
- [CVPR 2025] Understanding Fine-tuning CLIP for Open-vocabulary Semantic Segmentation in Hyperbolic Space☆36Updated 6 months ago
- [CVPR 2024] Offical implemention of the paper "DePT: Decoupled Prompt Tuning"☆109Updated 2 months ago
- Official PyTorch Code for Anchor Token Guided Prompt Learning Methods: [ICCV 2025] ATPrompt and [Arxiv 2511.21188] AnchorOPT☆122Updated last week
- [ICML2024] Official PyTorch implementation of CoMC: Language-Driven Cross-Modal Classifier for Zero-Shot Multi-Label Image Recognition☆16Updated last year
- (ECCV2024) Textual Knowledge Matters: Cross-Modality Co-Teaching for Generalized Visual Class Discovery (TextGCD)☆21Updated 2 months ago
- cliptrase☆47Updated last year
- [CVPR 2024 Accepted] TaskWeave: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection☆28Updated last year
- Official implementations of our LaZSL (ICCV'25)☆39Updated 6 months ago
- [ECCV' 24 Oral] CLIFF: Continual Latent Diffusion for Open-Vocabulary Object Detection☆29Updated last year
- The official implementation of paper Dual Modality Prompt Tuning for Vision-Language Pre-Trained Model. If you find our code or paper use…☆50Updated 2 years ago
- Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection☆64Updated 3 weeks ago
- GET: Unlocking the Multi-modal Potential of CLIP for Generalized Category Discovery (CVPR2025)☆32Updated 10 months ago
- [AAAI 2024] TagCLIP: A Local-to-Global Framework to Enhance Open-Vocabulary Multi-Label Classification of CLIP Without Training☆106Updated 2 years ago
- KTCN: Enhancing Open-World Object Detection with Knowledge Tansfer and Class-Awareness Neutralization (IJCAI 24)☆11Updated last year
- Code and Dataset for the paper "LAMM: Label Alignment for Multi-Modal Prompt Learning" AAAI 2024☆33Updated 2 years ago
- CLIP the Gap CVPR 2023☆83Updated 2 years ago
- (CVPR 2024) ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning☆49Updated last year