xiaojieli0903 / genview
[ECCV 2024] Official repository of "GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning".
☆23Updated 2 months ago
Related projects: ⓘ
- CorDA: Context-Oriented Decomposition Adaptation of Large Language Models☆30Updated 2 months ago
- Official repository of the “Mask Again: Masked Knowledge Distillation for Masked Video Modeling” (ACM MM 2023)☆24Updated 2 months ago
- Official repository of the "Fine-grained Key-Value Memory Enhanced Predictor for Video Representation Learning" (ACM MM 2023)☆21Updated 2 months ago
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆62Updated 4 months ago
- ☆21Updated last year
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆35Updated 11 months ago
- [ICCV2023] - CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation☆27Updated 3 weeks ago
- [CVPR 2024] Offical implemention of the paper "DePT: Decoupled Prompt Tuning"☆63Updated last month
- Official repository of ”Mamba-FSCIL: Dynamic Adaptation with Selective State Space Model for Few-Shot Class-Incremental Learning"☆18Updated 3 weeks ago
- [ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.☆27Updated last month
- Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning☆20Updated 9 months ago
- A much powerful probing method to tune your model with promising performance and linear probing training cost!☆15Updated last year
- [CVPR 2023] Diversity-Aware Meta Visual Prompting☆73Updated 9 months ago
- Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)☆62Updated 7 months ago
- CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models☆54Updated 2 months ago
- ☆16Updated last year
- cliptrase☆15Updated 2 weeks ago
- Code for paper "AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention"☆13Updated 2 months ago
- The official code of paper "Automated Multi-level Preference for MLLMs"☆15Updated 3 weeks ago
- Task Residual for Tuning Vision-Language Models (CVPR 2023)☆65Updated last year
- This repo holds the official code and data for "Beyond Literal Descriptions: Understanding and Locating Open-World Objects Aligned with H…☆17Updated 4 months ago
- [ICCV 2023 oral] This is the official repository for our paper: ''Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning''.☆60Updated 11 months ago
- ☆15Updated last year
- Code for Static and Dynamic Concepts for Self-supervised Video Representation Learning.☆10Updated 2 years ago
- ☆45Updated last year
- Simple PyTorch implementation of "Libra: Building Decoupled Vision System on Large Language Models" (accepted by ICML 2024)☆41Updated 3 months ago
- ☆30Updated 9 months ago
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆37Updated last month
- [ICCV 2023] Generative Prompt Model for Weakly Supervised Object Localization☆53Updated 10 months ago
- ☆32Updated 10 months ago