KoDohwan / VT-TWINSLinks
Video-Text Representation Learning via Differentiable Weak Temporal Alignment (PyTorch implementation for the CVPR 2022 paper)
☆12Updated 3 years ago
Alternatives and similar repositories for VT-TWINS
Users that are interested in VT-TWINS are comparing it to the libraries listed below
Sorting:
- The Pytorch implementation for "Video-Text Pre-training with Learned Regions"☆42Updated 3 years ago
- [ACL 2021] mTVR: Multilingual Video Moment Retrieval☆27Updated 3 years ago
- ☆14Updated 3 years ago
- MDMMT: Multidomain Multimodal Transformer for Video Retrieval☆26Updated 4 years ago