Research code for CVPR 2022 paper "SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning"
☆248May 26, 2022Updated 3 years ago
Alternatives and similar repositories for SwinBERT
Users that are interested in SwinBERT are comparing it to the libraries listed below
Sorting:
- [CVPR2022] Official code for Hierarchical Modular Network for Video Captioning. Our proposed HMN is implemented with PyTorch.☆50Sep 30, 2022Updated 3 years ago
- End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)☆230Jan 3, 2024Updated 2 years ago
- The code of IJCAI22 paper "GL-RG: Global-Local Representation Granularity for Video Captioning".☆18May 10, 2023Updated 2 years ago
- [ICCV 2023] Accurate and Fast Compressed Video Captioning☆52Jul 28, 2025Updated 7 months ago
- An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"☆365Jul 25, 2024Updated last year
- The first unofficial implementation of CLIP4Caption: CLIP for Video Caption (ACMMM 2021)☆15Jan 2, 2023Updated 3 years ago
- Official pytorch implementation of the AAAI 2021 paper "Semantic Grouping Network for Video Captioning"☆54Jul 9, 2021Updated 4 years ago
- ☆26Oct 20, 2021Updated 4 years ago
- This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP☆411Nov 14, 2022Updated 3 years ago
- Align and Prompt: Video-and-Language Pre-training with Entity Prompts☆188May 1, 2025Updated 10 months ago
- Video Grounding and Captioning