zchoi / GLSCL
Code for "Text-Video Retrieval with Global-Local Semantic Consistent Learning"
☆10Updated last week
Related projects ⓘ
Alternatives and complementary repositories for GLSCL
- The official code of Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval (AAAI2024)☆29Updated 7 months ago
- Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)☆58Updated 4 months ago
- Official Implementation of "The Surprising Effectiveness of Multimodal Large Language Models for Video Moment Retrieval"☆46Updated last week
- ☆20Updated 2 months ago
- [CVPR 2024] Official repository of the paper "Uncovering What, Why and How: A Comprehensive Benchmark for Causation Understanding of Vid…☆33Updated last week
- [CVPR 2024] Context-Guided Spatio-Temporal Video Grounding☆40Updated 4 months ago
- (CVPR2024) MeaCap: Memory-Augmented Zero-shot Image Captioning☆37Updated 2 months ago
- Official implementation of "Text Is MASS: Modeling as Stochastic Embedding for Text-Video Retrieval (CVPR 2024 Highlight)"☆56Updated 3 months ago
- ☆12Updated 10 months ago
- Composed Video Retrieval☆45Updated 6 months ago
- Source code of our CVPR2024 paper TeachCLIP for Text-to-Video Retrieval☆17Updated last week
- Official implementation of "Harnessing Large Language Models for Training-free Video Anomaly Detection", CVPR 2024☆56Updated 3 months ago
- Official project page of the paper "Towards Surveillance Video-and-Language Understanding: New Dataset, Baselines, and Challenges" (Accep…☆26Updated 6 months ago
- The code of MGCC: Text-based Occluded Person Re-identification via Multi-Granularity Contrastive Consistency Learning☆12Updated 3 months ago
- ☆11Updated 5 months ago
- Code for paper "VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos"☆80Updated 3 months ago
- [NeurIPS 2023] The official implementation of SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation☆28Updated 7 months ago
- Codes of the Fine-grained Textual Inversion network for Zero-Shot Composed Image Retrieval☆16Updated 3 months ago
- Official github repo for ICCV2023 paper 'Multi-event Video-Text Retrieval'☆18Updated 8 months ago
- VadCLIP official Pytorch implementation☆108Updated 8 months ago
- [CVPR'2024 Highlight] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments".☆224Updated 4 months ago
- [Preprint] VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding☆65Updated last month
- (CVPR2024) Realigning Confidence with Temporal Saliency Information for Point-level Weakly-Supervised Temporal Action Localization☆17Updated 5 months ago
- ☆13Updated last month
- Pytorch Code for "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" (ICCV 2023)☆61Updated 5 months ago
- Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)☆66Updated 9 months ago
- Official Implementation of SnAG (CVPR 2024)☆35Updated 2 weeks ago
- Official implementation of "Test-Time Zero-Shot Temporal Action Localization", CVPR 2024☆42Updated 2 months ago
- ☆35Updated 7 months ago