xiaojieli0903 / FGKVMemPred_video
Official repository of the "Fine-grained Key-Value Memory Enhanced Predictor for Video Representation Learning" (ACM MM 2023)
☆21Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for FGKVMemPred_video
- Official repository of the “Mask Again: Masked Knowledge Distillation for Masked Video Modeling” (ACM MM 2023)☆24Updated 4 months ago
- [ECCV 2024] Official repository of "GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning".☆23Updated 2 weeks ago
- CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for task-aware parameter-efficient fine-tuning(NeurIPS 2024)☆35Updated 3 weeks ago
- Official repository of ”Mamba-FSCIL: Dynamic Adaptation with Selective State Space Model for Few-Shot Class-Incremental Learning"☆21Updated 3 months ago
- This repo holds the official code and data for "Beyond Literal Descriptions: Understanding and Locating Open-World Objects Aligned with H…☆17Updated 6 months ago
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…☆44Updated last month
- [NeurIPS 2022 Spotlight] Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations☆121Updated 7 months ago
- [ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"☆69Updated 6 months ago
- [ICCV2023] - CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation☆29Updated last month
- Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]☆109Updated last year
- Multimodal Prompting with Missing Modalities for Visual Recognition, CVPR'23☆173Updated 11 months ago
- This is the first released survey paper on hallucinations of large vision-language models (LVLMs). To keep track of this field and contin…☆48Updated 3 months ago
- Papers about Hallucination in Multi-Modal Large Language Models (MLLMs)☆56Updated this week
- The code of MGCC: Text-based Occluded Person Re-identification via Multi-Granularity Contrastive Consistency Learning☆12Updated 4 months ago
- Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal …☆27Updated this week
- [ICCV 2023 & AAAI 2023] Binary Adapters & FacT, [Tech report] Convpass☆171Updated last year
- Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text …☆12Updated 2 months ago
- [CVPR 2023] Diversity-Aware Meta Visual Prompting☆78Updated 11 months ago
- ☆84Updated 11 months ago
- ☆36Updated 7 months ago
- Official code for CVPR 2024 paper, "SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models"☆16Updated 7 months ago
- Awesome Vision-Language Pretraining Papers☆29Updated 2 months ago
- HallE-Control: Controlling Object Hallucination in LMMs☆28Updated 7 months ago
- ☆174Updated 2 years ago
- The code of the paper of "A Differentiable Semantic Metric Approximation in Probabilistic Embedding for Cross-Modal Retrieval" accepted b…☆18Updated 10 months ago
- ☆76Updated last month
- [CVPR 2024] Offical implemention of the paper "DePT: Decoupled Prompt Tuning"☆75Updated this week
- [NeurIPS'22] This is an official implementation for "Scaling & Shifting Your Features: A New Baseline for Efficient Model Tuning".☆173Updated last year
- [CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding☆144Updated 4 months ago
- [CVPR 2024] Retrieval-Augmented Image Captioning with External Visual-Name Memory for Open-World Comprehension☆34Updated 7 months ago