BryanPlummer/flickr30k_entities

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/BryanPlummer/flickr30k_entities)

BryanPlummer / flickr30k_entities

Flickr30K Entities Dataset

☆185

Alternatives and similar repositories for flickr30k_entities

Users that are interested in flickr30k_entities are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lichengunc / refer
View on GitHub
Referring Expression Datasets API
☆573Aug 27, 2024Updated last year
necla-ml / SNLI-VE
View on GitHub
Dataset and starting code for visual entailment dataset
☆123Apr 21, 2022Updated 4 years ago
zyang-ur / ReSC
View on GitHub
Improving One-stage Visual Grounding by Recursive Sub-query Construction, ECCV 2020
☆90Sep 30, 2021Updated 4 years ago
zyang-ur / onestage_grounding
View on GitHub
A Fast and Accurate One-Stage Approach to Visual Grounding, ICCV 2019 (Oral)
☆150Nov 18, 2020Updated 5 years ago
djiajunustc / TransVG
View on GitHub
☆198Feb 27, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
TheShadow29 / awesome-grounding
View on GitHub
awesome grounding: A curated list of research papers in visual grounding
☆1,126Sep 21, 2025Updated 10 months ago
youngfly11 / LCMCG-PyTorch
View on GitHub
AAAI2020-The official implementation of "Learning Cross-modal Context Graph for Visual Grounding"
☆58Oct 25, 2021Updated 4 years ago
maximek3 / e-ViL
View on GitHub
☆41Nov 23, 2022Updated 3 years ago
BigRedT / info-ground
View on GitHub
Learning phrase grounding from captioned images through InfoNCE bound on mutual information
☆73Aug 22, 2020Updated 5 years ago
virginie-do / e-SNLI-VE
View on GitHub
e-SNLI-VE: Corrected Visual-Textual Entailment with Natural Language Explanations
☆14Aug 19, 2021Updated 4 years ago
google / localized-narratives
View on GitHub
Localized Narratives
☆86Sep 9, 2021Updated 4 years ago
ChopinSharp / ref-nms
View on GitHub
Official codebase for "Ref-NMS: Breaking Proposal Bottlenecks in Two-Stage Referring Expression Grounding"
☆22Dec 20, 2020Updated 5 years ago
zfchenUnique / Cops-Ref
View on GitHub
Accepted by CVPR 2020.
☆27Jul 11, 2024Updated 2 years ago
nithintata / image-caption-generator-using-deep-learning
View on GitHub
Automatically generates captions for an image using Image processing and NLP. Model was trained on Flickr30K dataset.
☆11Jun 11, 2020Updated 6 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
lichengunc / refer-parser2
View on GitHub
Referring Expression Parser
☆27Feb 10, 2018Updated 8 years ago
yangli18 / VLTVG
View on GitHub
Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022
☆97Dec 2, 2022Updated 3 years ago
svip-lab / LBYLNet
View on GitHub
[CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.
☆50Aug 31, 2021Updated 4 years ago
qinzzz / Multimodal-Alignment-Framework
View on GitHub
Implementation for MAF: Multimodal Alignment Framework
☆45Nov 25, 2020Updated 5 years ago
VinitSR7 / Image-Caption-Generation
View on GitHub
Image Captioning: Implementing the Neural Image Caption Generator
☆21Oct 14, 2020Updated 5 years ago
ashkamath / mdetr
View on GitHub
☆1,050Oct 3, 2022Updated 3 years ago
allenai / visual-reasoning-rationalization
View on GitHub
Code associated with the "Natural Language Rationales with Full-Stack Visual Reasoning" EMNLP Findings 2020 paper
☆24Jan 15, 2021Updated 5 years ago
jhuang81 / weak-sup-visual-grounding
View on GitHub
The official implementation of CVPR 2021 Paper: Improving Weakly Supervised Visual Grounding by Contrastive Knowledge Distillation.
☆12Oct 15, 2021Updated 4 years ago
BryanPlummer / cite
View on GitHub
Implementation for our paper "Conditional Image-Text Embedding Networks"
☆39Mar 19, 2020Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
LeapLabTHU / Pseudo-Q
View on GitHub
[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
☆153Jul 13, 2024Updated 2 years ago
liujch1998 / SoftLabelCCRF
View on GitHub
Implementation of Soft-Label Chain Conditional Random Field for Phrase Grounding in PyTorch
☆16Oct 21, 2022Updated 3 years ago
daqingliu / NMTree
View on GitHub
Code release for Learning to Assemble Neural Module Tree Networks for Visual Grounding (ICCV 2019)
☆38Nov 23, 2019Updated 6 years ago
microsoft / Oscar
View on GitHub
Oscar and VinVL
☆1,054Aug 28, 2023Updated 2 years ago
salesforce / ALBEF
View on GitHub
Code for ALBEF: a new vision-language pre-training method
☆1,755Sep 20, 2022Updated 3 years ago
insomnia94 / DTWREG
View on GitHub
Preliminary code for reviewers
☆12Mar 30, 2021Updated 5 years ago
microsoft / GLIP
View on GitHub
Grounded Language-Image Pre-training
☆2,605Jan 24, 2024Updated 2 years ago
ajd12342 / why-winoground-hard
View on GitHub
Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022
☆31May 29, 2023Updated 3 years ago
yuhangzang / OV-DETR
View on GitHub
[Under preparation] Code repo for "Open-Vocabulary DETR with Conditional Matching" (ECCV 2022)
☆240Aug 3, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
hassanhub / MultiGrounding
View on GitHub
This is the repo for Multi-level textual grounding
☆34Jul 21, 2020Updated 6 years ago
zdou0830 / METER
View on GitHub
METER: A Multimodal End-to-end TransformER Framework
☆377Nov 16, 2022Updated 3 years ago
nlab-mpg / Flickr30kEnt-JP
View on GitHub
☆13Aug 13, 2021Updated 4 years ago
mlfoundations / VisIT-Bench
View on GitHub
☆51Oct 29, 2023Updated 2 years ago
ajaysub110 / A-Neural-Compositional-Paradigm-for-Image-Captioning
View on GitHub
Implementation of 'A Neural Compositional Paradigm for Image Captioning' by B. Dai, S.Fidler, D. Lin
☆12Mar 15, 2019Updated 7 years ago
facebookresearch / ActivityNet-Entities
View on GitHub
A Dataset for Grounded Video Description
☆165Jan 4, 2022Updated 4 years ago
ChenRocks / UNITER
View on GitHub
Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"
☆799Jun 30, 2021Updated 5 years ago