Cadene / recipe1m.bootstrap.pytorchLinks

Retrieve recipes from foodie pictures using Deep Learning and Pytorch

☆57

Alternatives and similar repositories for recipe1m.bootstrap.pytorch

Users that are interested in recipe1m.bootstrap.pytorch are comparing it to the libraries listed below

Sorting:

Seth-Park / MultimodalExplanations
Code release for Park et al. Multimodal Multimodal Explanations: Justifying Decisions and Pointing to the Evidence. in CVPR, 2018
☆48Updated 6 years ago
yuweijiang / HGL-pytorch
Code for the model "Heterogeneous Graph Learning for Visual Commonsense Reasoning (NeurlPS 2019)"
☆47Updated 4 years ago
hwang1996 / ACME
Learning Cross-Modal Embeddings with Adversarial Networks for Cooking Recipes and Food Images
☆58Updated 6 years ago
yj-yu / lsmdc
☆32Updated 6 years ago
uwnlp / verb-attributes
Contains code for the EMNLP paper `Learning Linguistic Attributes for Zero-Shot Verb Classification'
☆26Updated 7 years ago
mesnico / TERN
Code and Resources for the Transformer Encoder Reasoning Network (TERN) - https://arxiv.org/abs/2004.09144
☆58Updated last year
hassanhub / MultiGrounding
This is the repo for Multi-level textual grounding
☆33Updated 4 years ago
XiaoxiaoGuo / fashion-retrieval
This repository contains an implementation of the models introduced in the paper Dialog-based Interactive Image Retrieval. The network is…
☆69Updated 4 years ago
hardyqr / HAL
[AAAI'20] Code release for "HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs".
☆38Updated last year
Maluuba / FigureQA
☆27Updated 5 years ago
airsplay / VisualRelationships
Data of ACL 2019 Paper "Expressing Visual Relationships via Language".
☆62Updated 4 years ago
hardyqr / Visual-Semantic-Embeddings-an-incomplete-list
A paper list of visual semantic embeddings and text-image retrieval.
☆41Updated 4 years ago
mitjanikolaus / compositional-image-captioning
Code for the CoNLL 2019 paper "Compositional Generalization in Image Captioning" by Mitja Nikolaus, Mostafa Abdou, Matthew Lamm, Rahul Ar…
☆26Updated 5 years ago
multimodal / multimodal
A collection of multimodal datasets, and visual features for VQA and captionning in pytorch. Just run "pip install multimodal"
☆82Updated 3 years ago
cdancette / rubi.bootstrap.pytorch
NeurIPS 2019 Paper: RUBi : Reducing Unimodal Biases for Visual Question Answering
☆62Updated 4 years ago
ecom-research / ComposeAE
Official code for WACV 2021 paper - Compositional Learning of Image-Text Query for Image Retrieval
☆57Updated 3 years ago
YuJiang01 / n2nmn_pytorch
implement n2nmn with pytorch
☆19Updated 6 years ago
yanbeic / VAL
Tensorflow Implementation on Paper [CVPR2020]Image Search with Text Feedback by Visiolinguistic Attention Learning
☆63Updated 4 years ago
aylai / DenotationGraph
Generate a denotation graph from a set of image captions
☆15Updated 6 years ago
ronghanghu / gqa_single_hop_baseline
A simple but well-performing "single-hop" visual attention model for the GQA dataset
☆20Updated 5 years ago
AmingWu / Multi-modal-Circulant-Fusion
the source code of Multi-modal Circulant Fusion (MCF) for Temporal Activity Localization
☆23Updated 6 years ago
intersun / LightningDOT
source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT
☆72Updated 2 years ago
antoine77340 / Mixture-of-Embedding-Experts
Mixture-of-Embeddings-Experts
☆120Updated 4 years ago
VisionLearningGroup / MULE
Implementation of "MULE: Multimodal Universal Language Embedding"
☆16Updated 5 years ago
Wangt-CN / MTFN-RR-PyTorch-Code
The offical code for paper "Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking", ACM Multimedia 2019 Oral
☆68Updated 5 years ago
lichengunc / pretrain-vl-data
Pre-trained V+L Data Preparation
☆46Updated 5 years ago
KaihuaTang / VCTree-Visual-Question-Answering
Code for the Visual Question Answering (VQA) part of CVPR 2019 oral paper: "Learning to Compose Dynamic Tree Structures for Visual Contex…
☆34Updated 6 years ago
zhegan27 / VILLA
Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": UNITER…
☆119Updated 4 years ago
Dong-JinKim / DenseRelationalCaptioning
Code of Dense Relational Captioning
☆69Updated 2 years ago
allenai / swig
Situation With Groundings (SWiG) dataset and Joint Situation Localizer (JSL)
☆66Updated 4 years ago