Harryqu123 / LMC
[NeurIPS 2023] LMC: Large Model Collaboration with Cross-assessment for Training-Free Open-Set Object Recognition
☆17Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for LMC
- Code Release of F-LMM: Grounding Frozen Large Multimodal Models☆52Updated 3 months ago
- ☆16Updated last month
- Towards a Unified View on Visual Parameter-Efficient Transfer Learning☆27Updated 2 years ago
- ☆57Updated last year
- VLPrompt: Vision-Language Prompting for Panoptic Scene Graph Generation☆20Updated 2 months ago
- [CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners☆41Updated last year
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆64Updated last month
- The offical implemention of JM3D.☆28Updated last year
- [AAAI2024] Code Release of CLIM: Contrastive Language-Image Mosaic for Region Representation☆24Updated 9 months ago
- [ICLR 2024] Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models.☆56Updated 3 months ago
- ☆22Updated last year
- Turning to Video for Transcript Sorting☆46Updated last year
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆45Updated 4 months ago
- Distilling Large Vision-Language Model with Out-of-Distribution Generalizability (ICCV 2023)☆54Updated 7 months ago
- [NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understanding☆32Updated last week
- Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision☆24Updated last month
- FreeVA: Offline MLLM as Training-Free Video Assistant☆49Updated 5 months ago
- [ICCV 2023 Oral] Official repository for “On the Robustness of Open-World Test-Time Training: Self-Training with Dynamic Prototype Expans…☆41Updated last year
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆78Updated 8 months ago
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆16Updated last year
- ☆22Updated last month
- [ICCV 2023] Generative Prompt Model for Weakly Supervised Object Localization☆55Updated last year
- OVAD: Open-vocabulary Attribute Detection code☆28Updated last year
- Repository of paper: Position-Enhanced Visual Instruction Tuning for Multimodal Large Language Models☆36Updated last year
- Open-Vocabulary Instance Segmentation via Robust Cross-Modal Pseudo-Labeling @ CVPR22☆42Updated 2 years ago
- ☆21Updated last year
- [CVPR 2023] RILS: Masked Visual Reconstruction in Language Semantic Space (https://arxiv.org/abs/2301.06958)☆44Updated last year
- [CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Prompt…☆32Updated 5 months ago
- [CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"☆27Updated 7 months ago
- The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"☆61Updated 7 months ago