Enriching MS-COCO with Chinese sentences and tags for cross-lingual multimedia tasks
☆215Feb 12, 2025Updated last year
Alternatives and similar repositories for coco-cn
Users that are interested in coco-cn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Cross-lingual image captioning☆92May 9, 2022Updated 4 years ago
- code for fluency-guided cross-lingual image captioning☆33Apr 13, 2018Updated 8 years ago
- A bilingual dataset for image captioning☆19Oct 28, 2020Updated 5 years ago
- Learning Fragment Self-Attention Embeddings for Image-Text Matching, in ACM MM 2019☆41Sep 24, 2019Updated 6 years ago
- Bridging Vision and Language Model☆286Mar 27, 2023Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)☆580May 18, 2023Updated 3 years ago
- Oscar and VinVL☆1,053Aug 28, 2023Updated 2 years ago
- Unpaired Image Captioning☆36Mar 25, 2021Updated 5 years ago
- ☆170Nov 9, 2023Updated 2 years ago
- ☆22Oct 9, 2021Updated 4 years ago
- Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019☆50Dec 18, 2019Updated 6 years ago
- Starter code for the VMT task and challenge☆51Jul 29, 2020Updated 5 years ago
- Code for Unsupervised Image Captioning☆223Mar 24, 2023Updated 3 years ago
- SPADnet: Deep RGB-SPAD Sensor Fusion Assisted by Monocular Depth Estimation☆18Jul 5, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Grid features pre-training code for visual question answering☆269Sep 17, 2021Updated 4 years ago
- Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.☆1,003Oct 5, 2023Updated 2 years ago
- PyTorch library for Visual-Semantic tasks☆29Nov 16, 2022Updated 3 years ago
- PyTorch code for ICCV'19 paper "Visual Semantic Reasoning for Image-Text Matching"☆304Jan 14, 2020Updated 6 years ago
- Code for paper "Attention on Attention for Image Captioning". ICCV 2019☆339May 2, 2021Updated 5 years ago
- project page for VinVL☆359Jul 26, 2023Updated 2 years ago
- UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning☆70May 20, 2021Updated 5 years ago
- Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019☆282Dec 21, 2022Updated 3 years ago
- Show, Edit and Tell: A Framework for Editing Image Captions, CVPR 2020☆82Jul 17, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Adaptive Offline Quintuplet Loss for Image-Text Matching (AOQ)☆34Jul 2, 2020Updated 5 years ago
- ☆35Mar 22, 2019Updated 7 years ago
- ☆66Dec 15, 2023Updated 2 years ago
- 图像中文描述+视觉注意力☆193Jan 9, 2020Updated 6 years ago
- ☆60Nov 29, 2016Updated 9 years ago
- Code for AI Challenger contest. (Generating chinese image captions)☆216Oct 19, 2018Updated 7 years ago
- Code accompanying the paper "Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs" (Chen et al., …☆200Dec 1, 2022Updated 3 years ago
- Re-implementation of the work Livebot☆16Jun 21, 2020Updated 5 years ago
- ☆218Feb 26, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of our CVPR2020 paper, Graph Structured Network for Image-Text Matching☆170Oct 12, 2020Updated 5 years ago
- ☆11Sep 7, 2020Updated 5 years ago
- Language-Agnostic Visual-Semantic Embeddings (ICCV'19)☆22Nov 11, 2019Updated 6 years ago
- METER: A Multimodal End-to-end TransformER Framework☆376Nov 16, 2022Updated 3 years ago
- Bling's Object detection tool☆55Jan 9, 2023Updated 3 years ago
- Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021 (Oral)☆165Aug 24, 2025Updated 9 months ago
- source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT☆72Nov 14, 2022Updated 3 years ago