li-xirong / coco-cnView external linksLinks
Enriching MS-COCO with Chinese sentences and tags for cross-lingual multimedia tasks
☆211Feb 12, 2025Updated last year
Alternatives and similar repositories for coco-cn
Users that are interested in coco-cn are comparing it to the libraries listed below
Sorting:
- Cross-lingual image captioning☆91May 9, 2022Updated 3 years ago
- code for fluency-guided cross-lingual image captioning☆33Apr 13, 2018Updated 7 years ago
- A bilingual dataset for image captioning☆19Oct 28, 2020Updated 5 years ago
- Learning Fragment Self-Attention Embeddings for Image-Text Matching, in ACM MM 2019☆41Sep 24, 2019Updated 6 years ago
- PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)☆579May 18, 2023Updated 2 years ago
- Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019☆51Dec 18, 2019Updated 6 years ago
- Bridging Vision and Language Model☆286Mar 27, 2023Updated 2 years ago
- Code for Unsupervised Image Captioning☆221Mar 24, 2023Updated 2 years ago
- Oscar and VinVL☆1,052Aug 28, 2023Updated 2 years ago
- ☆168Nov 9, 2023Updated 2 years ago
- ☆11Sep 7, 2020Updated 5 years ago
- PyTorch library for Visual-Semantic tasks☆29Nov 16, 2022Updated 3 years ago
- Show, Edit and Tell: A Framework for Editing Image Captions, CVPR 2020☆81Jul 17, 2020Updated 5 years ago
- Grid features pre-training code for visual question answering☆269Sep 17, 2021Updated 4 years ago
- Unpaired Image Captioning☆36Mar 25, 2021Updated 4 years ago
- ☆22Oct 9, 2021Updated 4 years ago
- Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019☆282Dec 21, 2022Updated 3 years ago
- Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.☆1,007Oct 5, 2023Updated 2 years ago
- project page for VinVL☆359Jul 26, 2023Updated 2 years ago
- Adaptive Offline Quintuplet Loss for Image-Text Matching (AOQ)☆34Jul 2, 2020Updated 5 years ago
- PyTorch code for ICCV'19 paper "Visual Semantic Reasoning for Image-Text Matching"☆302Jan 14, 2020Updated 6 years ago
- Starter code for the VMT task and challenge☆51Jul 29, 2020Updated 5 years ago
- Code for paper "Attention on Attention for Image Captioning". ICCV 2019☆339May 2, 2021Updated 4 years ago
- Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021 (Oral)☆164Aug 24, 2025Updated 5 months ago
- ☆218Feb 26, 2022Updated 3 years ago
- ☆35Mar 22, 2019Updated 6 years ago
- Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval (CVPR 2019)☆135Mar 15, 2024Updated last year
- Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.☆5,795Aug 29, 2025Updated 5 months ago
- Code accompanying the paper "Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs" (Chen et al., …☆200Dec 1, 2022Updated 3 years ago
- The Pytorch implementation for "Video-Text Pre-training with Learned Regions"☆42Jul 15, 2022Updated 3 years ago
- ☆65Dec 15, 2023Updated 2 years ago
- Code accompanying the paper "Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning".☆211Jun 12, 2020Updated 5 years ago
- Implementation of our CVPR2020 paper, Graph Structured Network for Image-Text Matching☆170Oct 12, 2020Updated 5 years ago
- Contrastive Learning for Image Captioning☆51Feb 22, 2018Updated 7 years ago
- Position Focused Attention Network for Image-Text Matching☆69Aug 20, 2019Updated 6 years ago
- ☆1,217May 13, 2024Updated last year
- [CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…☆723Aug 8, 2023Updated 2 years ago
- 图像中文描述+视觉注意力☆192Jan 9, 2020Updated 6 years ago
- UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning☆70May 20, 2021Updated 4 years ago