Enriching MS-COCO with Chinese sentences and tags for cross-lingual multimedia tasks
☆214Feb 12, 2025Updated last year
Alternatives and similar repositories for coco-cn
Users that are interested in coco-cn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Cross-lingual image captioning☆92May 9, 2022Updated 4 years ago
- code for fluency-guided cross-lingual image captioning☆33Apr 13, 2018Updated 8 years ago
- A bilingual dataset for image captioning☆19Oct 28, 2020Updated 5 years ago
- Learning Fragment Self-Attention Embeddings for Image-Text Matching, in ACM MM 2019☆41Sep 24, 2019Updated 6 years ago
- Bridging Vision and Language Model☆286Mar 27, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)☆580May 18, 2023Updated 2 years ago
- Oscar and VinVL☆1,053Aug 28, 2023Updated 2 years ago
- Unpaired Image Captioning☆36Mar 25, 2021Updated 5 years ago
- ☆170Nov 9, 2023Updated 2 years ago
- Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019☆50Dec 18, 2019Updated 6 years ago
- ☆22Oct 9, 2021Updated 4 years ago
- Starter code for the VMT task and challenge☆51Jul 29, 2020Updated 5 years ago
- Code for Unsupervised Image Captioning☆223Mar 24, 2023Updated 3 years ago
- SPADnet: Deep RGB-SPAD Sensor Fusion Assisted by Monocular Depth Estimation☆18Jul 5, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Grid features pre-training code for visual question answering☆269Sep 17, 2021Updated 4 years ago
- Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.☆1,003Oct 5, 2023Updated 2 years ago
- PyTorch library for Visual-Semantic tasks☆29Nov 16, 2022Updated 3 years ago
- PyTorch code for ICCV'19 paper "Visual Semantic Reasoning for Image-Text Matching"☆304Jan 14, 2020Updated 6 years ago
- Code for paper "Attention on Attention for Image Captioning". ICCV 2019☆339May 2, 2021Updated 5 years ago
- project page for VinVL☆359Jul 26, 2023Updated 2 years ago
- UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning☆70May 20, 2021Updated 4 years ago
- Show, Edit and Tell: A Framework for Editing Image Captions, CVPR 2020☆82Jul 17, 2020Updated 5 years ago
- Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019☆282Dec 21, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.☆5,887Mar 31, 2026Updated last month
- Adaptive Offline Quintuplet Loss for Image-Text Matching (AOQ)☆34Jul 2, 2020Updated 5 years ago
- ☆35Mar 22, 2019Updated 7 years ago
- ☆66Dec 15, 2023Updated 2 years ago
- 图像中文描述+视觉注意力☆193Jan 9, 2020Updated 6 years ago
- ☆60Nov 29, 2016Updated 9 years ago
- Code for AI Challenger contest. (Generating chinese image captions)☆216Oct 19, 2018Updated 7 years ago
- Code accompanying the paper "Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs" (Chen et al., …☆200Dec 1, 2022Updated 3 years ago
- Re-implementation of the work Livebot☆16Jun 21, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆218Feb 26, 2022Updated 4 years ago
- Implementation of our CVPR2020 paper, Graph Structured Network for Image-Text Matching☆170Oct 12, 2020Updated 5 years ago
- ☆11Sep 7, 2020Updated 5 years ago
- Language-Agnostic Visual-Semantic Embeddings (ICCV'19)☆22Nov 11, 2019Updated 6 years ago
- METER: A Multimodal End-to-end TransformER Framework☆376Nov 16, 2022Updated 3 years ago
- Bling's Object detection tool☆56Jan 9, 2023Updated 3 years ago
- Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021 (Oral)☆165Aug 24, 2025Updated 8 months ago