MightXiong / FedMITLinks
☆11Updated 6 months ago
Alternatives and similar repositories for FedMIT
Users that are interested in FedMIT are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2022 Spotlight] Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations☆140Updated last year
- [ICCV2023] - CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation☆36Updated 11 months ago
- Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)☆79Updated last year
- Code for paper "LLMs Can Evolve Continually on Modality for X-Modal Reasoning" NeurIPS2024☆37Updated 9 months ago
- ☆101Updated last year
- [TPAMI 2024] This is the official Pytorch code for our paper "Context Disentangling and Prototype Inheriting for Robust Visual Grounding"…☆19Updated 4 months ago
- Awesome Vision-Language Pretraining Papers☆34Updated 8 months ago
- [ICCV 2023] Prompt-aligned Gradient for Prompt Tuning☆167Updated 2 years ago
- Official repository of the "Fine-grained Key-Value Memory Enhanced Predictor for Video Representation Learning" (ACM MM 2023)☆23Updated last year
- This is the first released survey paper on hallucinations of large vision-language models (LVLMs). To keep track of this field and contin…☆78Updated last year
- [NeurIPS 2022] Embracing Consistency: A One-Stage Approach for Spatio-Temporal Video Grounding☆52Updated last year
- ☆36Updated last year
- Official repository of the “Mask Again: Masked Knowledge Distillation for Masked Video Modeling” (ACM MM 2023)☆27Updated last year
- ☆13Updated 2 years ago
- A lightweight codebase for referring expression comprehension and segmentation☆55Updated 3 years ago
- [ICCV'23 Main Track, WECIA'23 Oral] Official repository of paper titled "Self-regulating Prompts: Foundational Model Adaptation without F…☆276Updated last year
- A comprehensive survey of Composed Multi-modal Retrieval (CMR), including Composed Image Retrieval (CIR) and Composed Video Retrieval (CV…☆55Updated last month
- ☆21Updated last year
- Official implementation of HawkEye: Training Video-Text LLMs for Grounding Text in Videos☆42Updated last year
- FineCLIP: Self-distilled Region-based CLIP for Better Fine-grained Understanding (NIPS24)☆28Updated last week
- [ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models☆171Updated last year
- ☆94Updated 2 years ago
- Instruction Tuning in Continual Learning paradigm☆59Updated 7 months ago
- [TMM 2023] Self-paced Curriculum Adapting of CLIP for Visual Grounding.☆130Updated last month
- [ICCV 2023] ALIP: Adaptive Language-Image Pre-training with Synthetic Caption☆98Updated 2 years ago
- ☆21Updated 2 years ago
- Collection of Composed Image Retrieval (CIR) papers.☆265Updated last month
- [ICCV 2023 & AAAI 2023] Binary Adapters & FacT, [Tech report] Convpass☆193Updated 2 years ago
- Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models" [ICCV'23]☆104Updated 2 years ago
- ☆172Updated last year