Enriching MS-COCO with Chinese sentences and tags for cross-lingual multimedia tasks
☆212Feb 12, 2025Updated last year
Alternatives and similar repositories for coco-cn
Users that are interested in coco-cn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Cross-lingual image captioning☆91May 9, 2022Updated 3 years ago
- code for fluency-guided cross-lingual image captioning☆33Apr 13, 2018Updated 7 years ago
- A bilingual dataset for image captioning☆19Oct 28, 2020Updated 5 years ago
- Learning Fragment Self-Attention Embeddings for Image-Text Matching, in ACM MM 2019☆41Sep 24, 2019Updated 6 years ago
- Bridging Vision and Language Model☆286Mar 27, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)☆579May 18, 2023Updated 2 years ago
- Oscar and VinVL☆1,052Aug 28, 2023Updated 2 years ago
- Unpaired Image Captioning☆36Mar 25, 2021Updated 5 years ago
- ☆169Nov 9, 2023Updated 2 years ago
- Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019☆51Dec 18, 2019Updated 6 years ago
- ☆22Oct 9, 2021Updated 4 years ago
- Starter code for the VMT task and challenge☆51Jul 29, 2020Updated 5 years ago
- Code for Unsupervised Image Captioning☆223Mar 24, 2023Updated 3 years ago
- SPADnet: Deep RGB-SPAD Sensor Fusion Assisted by Monocular Depth Estimation☆17Jul 5, 2020Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Grid features pre-training code for visual question answering☆269Sep 17, 2021Updated 4 years ago
- Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.☆1,005Oct 5, 2023Updated 2 years ago
- PyTorch library for Visual-Semantic tasks☆29Nov 16, 2022Updated 3 years ago
- PyTorch code for ICCV'19 paper "Visual Semantic Reasoning for Image-Text Matching"☆302Jan 14, 2020Updated 6 years ago
- Code for paper "Attention on Attention for Image Captioning". ICCV 2019☆339May 2, 2021Updated 4 years ago
- project page for VinVL☆359Jul 26, 2023Updated 2 years ago
- UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning☆70May 20, 2021Updated 4 years ago
- Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019☆282Dec 21, 2022Updated 3 years ago
- Show, Edit and Tell: A Framework for Editing Image Captions, CVPR 2020☆82Jul 17, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Adaptive Offline Quintuplet Loss for Image-Text Matching (AOQ)☆34Jul 2, 2020Updated 5 years ago
- Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.☆5,841Aug 29, 2025Updated 7 months ago
- ☆35Mar 22, 2019Updated 7 years ago
- ☆65Dec 15, 2023Updated 2 years ago
- 图像中文描述+视觉注意力☆193Jan 9, 2020Updated 6 years ago
- Code for AI Challenger contest. (Generating chinese image captions)☆216Oct 19, 2018Updated 7 years ago
- ☆10Jun 28, 2023Updated 2 years ago
- Code accompanying the paper "Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs" (Chen et al., …☆200Dec 1, 2022Updated 3 years ago
- Re-implementation of the work Livebot☆16Jun 21, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆218Feb 26, 2022Updated 4 years ago
- Implementation of our CVPR2020 paper, Graph Structured Network for Image-Text Matching☆170Oct 12, 2020Updated 5 years ago
- ☆11Sep 7, 2020Updated 5 years ago
- Language-Agnostic Visual-Semantic Embeddings (ICCV'19)☆22Nov 11, 2019Updated 6 years ago
- METER: A Multimodal End-to-end TransformER Framework☆377Nov 16, 2022Updated 3 years ago
- Bling's Object detection tool☆56Jan 9, 2023Updated 3 years ago
- Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021 (Oral)☆164Aug 24, 2025Updated 7 months ago