Enriching MS-COCO with Chinese sentences and tags for cross-lingual multimedia tasks
☆213Feb 12, 2025Updated last year
Alternatives and similar repositories for coco-cn
Users that are interested in coco-cn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Cross-lingual image captioning☆92May 9, 2022Updated 3 years ago
- code for fluency-guided cross-lingual image captioning☆33Apr 13, 2018Updated 8 years ago
- A bilingual dataset for image captioning☆19Oct 28, 2020Updated 5 years ago
- Learning Fragment Self-Attention Embeddings for Image-Text Matching, in ACM MM 2019☆41Sep 24, 2019Updated 6 years ago
- Bridging Vision and Language Model☆286Mar 27, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)☆579May 18, 2023Updated 2 years ago
- Oscar and VinVL☆1,051Aug 28, 2023Updated 2 years ago
- Unpaired Image Captioning☆36Mar 25, 2021Updated 5 years ago
- ☆169Nov 9, 2023Updated 2 years ago
- Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019☆50Dec 18, 2019Updated 6 years ago
- ☆22Oct 9, 2021Updated 4 years ago
- Starter code for the VMT task and challenge☆51Jul 29, 2020Updated 5 years ago
- Code for Unsupervised Image Captioning☆223Mar 24, 2023Updated 3 years ago
- SPADnet: Deep RGB-SPAD Sensor Fusion Assisted by Monocular Depth Estimation☆18Jul 5, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Grid features pre-training code for visual question answering☆269Sep 17, 2021Updated 4 years ago
- Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.☆1,003Oct 5, 2023Updated 2 years ago
- PyTorch library for Visual-Semantic tasks☆29Nov 16, 2022Updated 3 years ago
- PyTorch code for ICCV'19 paper "Visual Semantic Reasoning for Image-Text Matching"☆302Jan 14, 2020Updated 6 years ago
- Code for paper "Attention on Attention for Image Captioning". ICCV 2019☆339May 2, 2021Updated 4 years ago
- project page for VinVL☆359Jul 26, 2023Updated 2 years ago
- UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning☆70May 20, 2021Updated 4 years ago
- Show, Edit and Tell: A Framework for Editing Image Captions, CVPR 2020☆82Jul 17, 2020Updated 5 years ago
- Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019☆282Dec 21, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.☆5,868Mar 31, 2026Updated 2 weeks ago
- Adaptive Offline Quintuplet Loss for Image-Text Matching (AOQ)☆34Jul 2, 2020Updated 5 years ago
- ☆35Mar 22, 2019Updated 7 years ago
- ☆65Dec 15, 2023Updated 2 years ago
- 图像中文描述+视觉注意力☆193Jan 9, 2020Updated 6 years ago
- ☆60Nov 29, 2016Updated 9 years ago
- Code for AI Challenger contest. (Generating chinese image captions)☆216Oct 19, 2018Updated 7 years ago
- ☆10Jun 28, 2023Updated 2 years ago
- Code accompanying the paper "Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs" (Chen et al., …☆200Dec 1, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Re-implementation of the work Livebot☆16Jun 21, 2020Updated 5 years ago
- ☆218Feb 26, 2022Updated 4 years ago
- Implementation of our CVPR2020 paper, Graph Structured Network for Image-Text Matching☆170Oct 12, 2020Updated 5 years ago
- ☆11Sep 7, 2020Updated 5 years ago
- Language-Agnostic Visual-Semantic Embeddings (ICCV'19)☆22Nov 11, 2019Updated 6 years ago
- METER: A Multimodal End-to-end TransformER Framework☆376Nov 16, 2022Updated 3 years ago
- Bling's Object detection tool☆56Jan 9, 2023Updated 3 years ago