zhongfansun / HPMLinks
Breaking Boundary Between Pre-training and Fine-tuning with Hybrid Prompting for Knowledge-Based VQA
☆143Updated last year
Alternatives and similar repositories for HPM
Users that are interested in HPM are comparing it to the libraries listed below
Sorting:
- ☆139Updated 3 months ago
- ☆138Updated 4 months ago
- ☆137Updated last year
- ☆141Updated 11 months ago
- ☆141Updated 2 years ago
- This repository contains the reference source code for the paper ["Scalable Modular Network: A Framework for Adaptive Learning via Agreem…☆138Updated last year
- official codes for our WACV 2024 paper (Interpretable Object Recognition by Semantic Prototype Analysis)☆144Updated last year
- This repository is the official implementation of the paper "Understanding Few-Shot Learning: Measuring Task Relatedness and Adaptation D…☆146Updated last year
- Codes for the WACV 2023 paper: "Semantic Guided Latent Parts Embedding for Few-Shot Learning"☆146Updated 2 years ago
- ☆171Updated 4 months ago
- Implement Diffusion Models with PyTorch.☆23Updated 9 months ago
- This repository is intended to store the code and data for ASAP (Advancing Semantic Alignment Promotes Multi-Modal Manipulation Detecting…☆14Updated 2 months ago
- Official repository of the "Fine-grained Key-Value Memory Enhanced Predictor for Video Representation Learning" (ACM MM 2023)☆23Updated last year
- Codebase for "Jodi: Unification of Visual Generation and Understanding via Joint Modeling"☆83Updated 2 months ago
- A large-scale dataset composed of high-quality synthetic images aimed at evaluating social biases in LVLMs☆13Updated 3 months ago
- [ECCV'24] T2IShield: Defending Against Backdoors on Text-to-Image Diffusion Models☆15Updated 2 months ago
- The datasets for image emotion computing☆36Updated 3 years ago
- Facial Action Unit Detection Model and Visualization Canvas☆26Updated 2 weeks ago
- Pose-disentangled Contrastive Learning☆14Updated last year
- ☆10Updated 11 months ago
- Implementation (R2R part) for the paper "Iterative Vision-and-Language Navigation"☆17Updated last year
- ☆11Updated 5 months ago
- [CVPR 2024] How to Configure Good In-Context Sequence for Visual Question Answering☆20Updated 3 months ago
- [CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".☆771Updated 2 years ago
- Deepfake + LLM (CVPR 2025 oral)☆59Updated last month
- [CVPR 2024] Troika: Multi-Path Cross-Modal Traction for Compositional Zero-Shot Learning☆24Updated 5 months ago
- A curated list of recent papers (2023–2025) on controllable generative models, covering diffusion-based architectures with fine-grained c…☆30Updated 2 months ago
- Local self-attention in Transformer for visual question answering☆12Updated last year
- A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.☆657Updated 2 weeks ago
- [ECCVW/TWYN 2024 - Best Workshop Paper] Are CLIP features all you need for Universal Synthetic Image Origin Attribution?☆11Updated 7 months ago