ruc-aimc-lab / LAFF
Source code of ECCV2022 LAFF for Text-to-Video Retrieval
☆44Updated last year
Alternatives and similar repositories for LAFF:
Users that are interested in LAFF are comparing it to the libraries listed below
- Official PyTorch implementation of the ECCV 2022 paper: Efficient Video Transformers with Spatial-Temporal Token Selection.☆46Updated 2 years ago
- FutabaSakuraXD / Farewell-to-Mutual-Information-Variational-Distiilation-for-Cross-Modal-Person-Re-identification☆54Updated 3 years ago
- ☆24Updated last year
- Official implementation of "Everything at Once - Multi-modal Fusion Transformer for Video Retrieval". CVPR 2022☆98Updated 2 years ago
- ☆29Updated 10 months ago
- ☆66Updated last year
- [CVPR 2022] Cross-Architecture Self-supervised Video Representation Learning☆22Updated 2 years ago
- [TIP2023] The code of “Plug-and-Play Regulators for Image-Text Matching”☆30Updated 10 months ago
- Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]☆114Updated last year
- Deep Evidential Learning with Noisy Correspondence for Cross-modal Retrieval ( ACM Multimedia 2022, Pytorch Code)☆40Updated 10 months ago
- PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning☆84Updated 3 years ago
- Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022☆95Updated 2 years ago
- ☆73Updated last year
- PyTorch Implementation of Deep Equilibrium Multimodal Fusion☆17Updated last year
- Source code of Universal Weighting Metric Learning for Cross-Modal Matching. The paper is accepted by CVPR2020.☆22Updated 2 years ago
- Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"☆99Updated last year
- Source code of our CVPR2024 paper TeachCLIP for Text-to-Video Retrieval☆26Updated last month
- (TIP'2023) Concept-Aware Video Captioning: Describing Videos with Effective Prior Information☆26Updated last month
- Implementation of our CVPR2022 paper, Negative-Aware Attention Framework for Image-Text Matching.☆111Updated last year
- [AAAI 2023] Contrastive Masked Autoencoders for Self-Supervised Video Hashing☆26Updated last year
- Code for ECCV 2022 Workshop paper "See Finer, See More: Implicit Modality Alignment for Text-based Person Retrieval"☆20Updated last year
- Code of SSAN☆61Updated 11 months ago
- Codes for ECCV2022 paper - contrastive deep supervision☆68Updated 2 years ago
- Code for the paper: Anticipative Feature Fusion Transformer for Multi-Modal Action Anticipation.☆30Updated last year
- The official implementation of 'Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation' (CVPR 2…☆46Updated 2 years ago
- ☆47Updated 2 years ago
- A simple but efficient transformer model for video action recognition☆57Updated 2 years ago
- ☆24Updated 2 years ago
- Video Test-Time Adaptation for Action Recognition (CVPR 2023)☆39Updated 4 months ago
- [CVPR22] Group Contextualization for Video Recognition☆22Updated last year