MightXiong / FedMITLinks
☆11Updated 2 months ago
Alternatives and similar repositories for FedMIT
Users that are interested in FedMIT are comparing it to the libraries listed below
Sorting:
- [ICCV2023] - CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation☆35Updated 7 months ago
- Code for paper "LLMs Can Evolve Continually on Modality for X-Modal Reasoning" NeurIPS2024☆36Updated 5 months ago
- ☆34Updated last year
- Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)☆72Updated 11 months ago
- [ECCV 2022] Dual-Evidential Learning for Weakly-supervised Temporal Action Localization☆41Updated last year
- ☆30Updated last year
- [ECCV 22] LocVTP: Video-Text Pre-training for Temporal Localization☆39Updated 2 years ago
- Scanning Only Once: An End-to-end Framework for Fast Temporal Grounding in Long Videos☆24Updated 11 months ago
- ☆39Updated last year
- [CVPR 2023] Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception☆35Updated last year
- Generating Structured Pseudo Labels for Noise-resistant Zero-shot Video Sentence Localization☆14Updated last year
- Official implementation of HawkEye: Training Video-Text LLMs for Grounding Text in Videos☆43Updated last year
- [ICLR 2025] TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning☆33Updated 2 months ago
- [TMM 2023] Self-paced Curriculum Adapting of CLIP for Visual Grounding.☆123Updated 4 months ago
- [NeurIPS 2022 Spotlight] Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations☆134Updated last year
- ☆21Updated 2 years ago
- ☆96Updated last year
- A lightweight codebase for referring expression comprehension and segmentation☆55Updated 3 years ago
- A reading list of papers about Visual Grounding.☆31Updated 2 years ago
- This repo holds the official code and data for "Beyond Literal Descriptions: Understanding and Locating Open-World Objects Aligned with H…☆17Updated last year
- Official github repo for ICCV2023 paper 'Multi-event Video-Text Retrieval'☆18Updated last year
- [ACM MM 22] Correspondence Matters for Video Referring Expression Comprehension☆15Updated 2 years ago
- [CVPR 2023] Cascade Evidential Learning for Open-world Weakly-supervised Temporal Action Localization☆11Updated 10 months ago
- [TPAMI 2024] This is the Pytorch code for our paper "Context Disentangling and Prototype Inheriting for Robust Visual Grounding".☆17Updated 3 weeks ago
- ☆92Updated last year
- Awesome Vision-Language Pretraining Papers☆30Updated 4 months ago
- An official implementation for MS-DETR in ACL'23☆17Updated 2 years ago
- ☆31Updated 3 years ago
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆68Updated 7 months ago
- [NeurIPS 2022] Embracing Consistency: A One-Stage Approach for Spatio-Temporal Video Grounding☆50Updated last year