Trunpm / TPT-for-VideoQALinks
☆19Updated 2 years ago
Alternatives and similar repositories for TPT-for-VideoQA
Users that are interested in TPT-for-VideoQA are comparing it to the libraries listed below
Sorting:
- ☆9Updated 2 years ago
- Lightweight Transformer for Multi-modal Tasks☆16Updated 2 years ago
- SMCA replication☆21Updated 3 years ago
- Fine-Grained Visual Classification via Simultaneously Learning of Multi-regional Multi-grained Features☆12Updated 4 years ago
- Code for cross-modal image retrieval for SYSU-MM01☆16Updated 5 years ago
- awesome video-based self-supervised learning methods in recently years☆9Updated 4 years ago
- PyTorch implementation of HANet: Hierarchical Alignment Networks for Video-Text Retrieval (ACM MM 2021).☆47Updated 3 years ago
- image captioning paper list☆8Updated 5 years ago
- ☆35Updated last year
- 🏆 The 2nd Place Submission to the CVPR2021-Evoked Emotion from Videos challenge.☆17Updated 4 years ago
- ☆11Updated 4 years ago
- ☆24Updated 3 years ago
- Published in CVPR 2020; matlab codes☆22Updated 10 months ago
- Gender/Age attribute grounding using weak supervised manner.☆12Updated 6 years ago
- Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms☆20Updated 3 years ago
- Character Grounding and Re-Identification in Story of Videos and Text Descriptions☆10Updated 4 years ago
- MDMMT: Multidomain Multimodal Transformer for Video Retrieval☆26Updated 4 years ago
- PIC Challenge Baseline☆19Updated 6 years ago
- ☆29Updated last year
- The Pytorch implementation for "Video-Text Pre-training with Learned Regions"☆42Updated 3 years ago
- [AAAI'20] Code release for "HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs".☆38Updated last year
- Knowledge-guided Pairwise Reconstruction Network for Weakly Supervised Referring Expression Grounding☆4Updated 4 years ago
- ☆72Updated last year
- ☆20Updated 2 years ago
- ☆26Updated 4 years ago
- A Simple Framwork for CV Pre-training Model (SOCO, VirTex, BEiT)☆15Updated 3 years ago
- Code for Temporal Data Augmentations (ECCVW 2020)☆37Updated 4 years ago
- ☆16Updated 4 years ago
- Shared Attention for Multi-label Zero-shot Learning accepted @ CVPR20☆32Updated 3 years ago
- [ICCV 2021] Official PyTorch implementation for Deep Relational Metric Learning.☆43Updated 3 years ago