Harryqu123 / LMC
[NeurIPS 2023] LMC: Large Model Collaboration with Cross-assessment for Training-Free Open-Set Object Recognition
☆17Updated 7 months ago
Alternatives and similar repositories for LMC:
Users that are interested in LMC are comparing it to the libraries listed below
- ☆18Updated 2 months ago
- The offical implemention of JM3D.☆28Updated last year
- Code Release of F-LMM: Grounding Frozen Large Multimodal Models☆60Updated 5 months ago
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆46Updated 6 months ago
- [ECCV 2024 Best Paper Candidate] Implementation of "Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Vi…☆48Updated 3 months ago
- (NeurIPS 2024) What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights☆22Updated 2 months ago
- [AAAI2024] Code Release of CLIM: Contrastive Language-Image Mosaic for Region Representation☆27Updated 11 months ago
- ☆58Updated last year
- ☆16Updated last year
- ☆22Updated last year
- [TPAMI reviewing] Towards Visual Grounding: A Survey☆42Updated this week
- [ICCV 2023] Generative Prompt Model for Weakly Supervised Object Localization☆54Updated last year
- [CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners☆41Updated last year
- ☆27Updated 3 months ago
- [ICCV 2023] HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic Scene Graph Generation☆33Updated 11 months ago
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆66Updated 3 months ago
- ☆21Updated last year
- ✨A curated list of papers on the uncertainty in multi-modal large language model (MLLM).☆27Updated this week
- VLPrompt: Vision-Language Prompting for Panoptic Scene Graph Generation☆22Updated 3 months ago
- Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023☆48Updated 11 months ago
- ☆34Updated last year
- Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision☆27Updated 2 months ago
- Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning☆19Updated 4 months ago
- cliptrase☆28Updated 4 months ago
- OVSegmentor, CVPR23☆57Updated 8 months ago
- ☆12Updated 2 months ago
- Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models" [ICCV'23]☆96Updated last year
- MADAv2: Advanced Multi-Anchor Based Active Domain Adaptation Segmentation☆25Updated last year
- Towards a Unified View on Visual Parameter-Efficient Transfer Learning☆27Updated 2 years ago
- [ICCV 2023 Oral] Official repository for “On the Robustness of Open-World Test-Time Training: Self-Training with Dynamic Prototype Expans…☆45Updated last month