ltttpku / CMMP
☆12Updated 2 months ago
Related projects: ⓘ
- Code Release of F-LMM: Grounding Frozen Large Multimodal Models☆35Updated last month
- The official implementation of ADDP (ICLR 2024)☆11Updated 5 months ago
- Towards a Unified View on Visual Parameter-Efficient Transfer Learning☆26Updated last year
- [ECCV2024] Learning Video Context as Interleaved Multimodal Sequences☆17Updated 3 weeks ago
- ☆12Updated 3 weeks ago
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆35Updated last year
- Official implementation of TagAlign☆31Updated 5 months ago
- REVO-LION: Evaluating and Refining Vision-Language Instruction Tuning Datasets☆11Updated 11 months ago
- [CVPR'24] Code for Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models☆10Updated last month
- ☆19Updated 11 months ago
- [WACV 2024] Instruct Me More! Random Prompting for Visual In-Context Learning☆13Updated 5 months ago
- [ICLR 23] Contrastive Aligned of Vision to Language Through Parameter-Efficient Transfer Learning☆36Updated last year
- ☆11Updated 7 months ago
- Visual self-questioning for large vision-language assistant.☆22Updated 3 weeks ago
- ☆17Updated last year
- [CVPR 2024] The official pytorch implementation of "A General and Efficient Training for Transformer via Token Expansion".☆36Updated 4 months ago
- ☆15Updated 4 months ago
- ☆20Updated last year
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆15Updated last year
- ☆21Updated last year
- [ICML 2024] Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning☆41Updated 4 months ago
- Stay tuned!☆11Updated 5 months ago
- [NeurIPS 2023] LMC: Large Model Collaboration with Cross-assessment for Training-Free Open-Set Object Recognition☆14Updated 3 months ago
- [CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Prompt…☆26Updated 2 months ago
- ☆20Updated 11 months ago
- Official code for paper "Beyond Sole Strength: Customized Ensembles for Generalized Vision-Language Models, ICML2024"☆19Updated 4 months ago
- Repository of paper: Position-Enhanced Visual Instruction Tuning for Multimodal Large Language Models☆36Updated last year
- [ICML2024]The official implementation of SemiRES in PyTorch.☆18Updated 3 months ago
- Simple PyTorch implementation of "Libra: Building Decoupled Vision System on Large Language Models" (accepted by ICML 2024)☆41Updated 3 months ago
- Turning to Video for Transcript Sorting☆44Updated last year