google/localized-narratives

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/google/localized-narratives)

google / localized-narratives

Localized Narratives

☆86

Alternatives and similar repositories for localized-narratives

Users that are interested in localized-narratives are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

facebookresearch / connect-caption-and-trace
View on GitHub
A unified framework to jointly model images, text, and human attention traces.
☆80May 24, 2021Updated 5 years ago
google-research / trecs_image_generation
View on GitHub
☆24Feb 24, 2021Updated 5 years ago
fenglinliu98 / MIA
View on GitHub
Code for "Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations" （NeurIPS 2019）
☆65Oct 19, 2020Updated 5 years ago
1Konny / HVP
View on GitHub
PyTorch implementation of our paper, "Revisiting Hierarchical Approach for Persistent Long-Term Video Prediction."
☆21Feb 10, 2021Updated 5 years ago
yonatanbitton / data_efficient_masked_language_modeling_for_vision_and_language
View on GitHub
Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".
☆18Sep 17, 2021Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
researchmm / soho
View on GitHub
[CVPR'21 Oral] Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning
☆208Sep 30, 2022Updated 3 years ago
YuanEZhou / Grounded-Image-Captioning
View on GitHub
☆64Jan 5, 2022Updated 4 years ago
alasdairtran / transform-and-tell
View on GitHub
[CVPR 2020] Transform and Tell: Entity-Aware News Image Captioning
☆93Apr 19, 2024Updated 2 years ago
Gitsamshi / WeakVRD-Captioning
View on GitHub
Implementation of paper "Improving Image Captioning with Better Use of Caption"
☆33Sep 15, 2020Updated 5 years ago
mshukor / eP-ALM
View on GitHub
[ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.
☆27Oct 27, 2023Updated 2 years ago
BryanPlummer / flickr30k_entities
View on GitHub
Flickr30K Entities Dataset
☆185Dec 23, 2018Updated 7 years ago
sarthak268 / c3vqg-official
View on GitHub
PyTorch Implementation for the paper "C3VQG: Category Consistent Cyclic Visual Question Generation" (ACM MM Asia'20).
☆16Mar 31, 2023Updated 3 years ago
zinengtang / VidLanKD
View on GitHub
Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))
☆56Feb 6, 2023Updated 3 years ago
BCV-Uniandes / PNG
View on GitHub
☆61Oct 23, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
shoaibahmed / metadata_archaeology
View on GitHub
Official code for the paper: "Metadata Archaeology"
☆19May 10, 2023Updated 3 years ago
locuslab / T-MARS
View on GitHub
Code for T-MARS data filtering
☆35Aug 23, 2023Updated 2 years ago
cshizhe / asg2cap
View on GitHub
Code accompanying the paper "Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs" (Chen et al., …
☆200Dec 1, 2022Updated 3 years ago
BigRedT / info-ground
View on GitHub
Learning phrase grounding from captioned images through InfoNCE bound on mutual information
☆73Aug 22, 2020Updated 5 years ago
facebookresearch / grid-feats-vqa
View on GitHub
Grid features pre-training code for visual question answering
☆269Sep 17, 2021Updated 4 years ago
airsplay / py-bottom-up-attention
View on GitHub
PyTorch bottom-up attention with Detectron2
☆239Jan 4, 2022Updated 4 years ago
gsig / visual-grounding
View on GitHub
Project page for "Visual Grounding in Video for Unsupervised Word Translation" CVPR 2020
☆43Apr 26, 2020Updated 6 years ago
microsoft / Oscar
View on GitHub
Oscar and VinVL
☆1,054Aug 28, 2023Updated 2 years ago
evanmiltenburg / MeasureDiversity
View on GitHub
Measure the diversity of image descriptions, repository for our COLING 2018 paper.
☆13Dec 29, 2019Updated 6 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
qingzwang / DiversityMetrics
View on GitHub
This is the implementation of self-CIDEr and LSA-based diversity metrics (only for python 2.7).
☆37Feb 26, 2022Updated 4 years ago
szzexpoi / AiR
View on GitHub
Official Repository for ECCV 2020 paper "AiR: Attention with Reasoning Capability"
☆54Jun 29, 2021Updated 5 years ago
ylsung / VL_adapter
View on GitHub
PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)
☆212Dec 18, 2022Updated 3 years ago
LuoweiZhou / VLP
View on GitHub
Vision-Language Pre-training for Image Captioning and Question Answering
☆420Jan 18, 2022Updated 4 years ago
helblazer811 / oracle-guided-image-synthesis
View on GitHub
This is a repository for my work on the paper "Oracle Guided Image Synthesis with Relative Queries".
☆24May 6, 2022Updated 4 years ago
jalayrac / object-states-action
View on GitHub
Code for the paper Joint Discovery of Object States and Manipulation Actions, ICCV 2017
☆14Aug 7, 2018Updated 7 years ago
amitakamath / vl_text_encoders_are_bottlenecks
View on GitHub
Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!
☆11May 24, 2023Updated 3 years ago
kdexd / virtex
View on GitHub
[CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations
☆561Aug 22, 2025Updated 11 months ago
caiqi / Joint-Contrastive-Learning
View on GitHub
☆42Oct 9, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
bearcatt / LaBERT
View on GitHub
A length-controllable and non-autoregressive image captioning model.
☆69Jun 10, 2021Updated 5 years ago
facebookresearch / ActivityNet-Entities
View on GitHub
A Dataset for Grounded Video Description
☆165Jan 4, 2022Updated 4 years ago
MILVLG / mt-captioning
View on GitHub
A PyTorch implementation of the paper Multimodal Transformer with Multiview Visual Representation for Image Captioning
☆25Sep 4, 2020Updated 5 years ago
researchmm / generate-it
View on GitHub
A collection of models for image<->text generation in ACM MM 2021.
☆67Oct 31, 2021Updated 4 years ago
google-research-datasets / conceptual-captions
View on GitHub
Conceptual Captions is a dataset containing (image-URL, caption) pairs designed for the training and evaluation of machine learned image …
☆567Aug 21, 2021Updated 4 years ago
ruotianluo / GoogleConceptualCaptioning
View on GitHub
☆53Dec 13, 2019Updated 6 years ago
MILVLG / bottom-up-attention.pytorch
View on GitHub
A PyTorch reimplementation of bottom-up-attention models
☆302Apr 7, 2022Updated 4 years ago