torralba-lab / im2recipe-Pytorch
im2recipe Pytorch implementation
☆275Updated 10 months ago
Alternatives and similar repositories for im2recipe-Pytorch:
Users that are interested in im2recipe-Pytorch are comparing it to the libraries listed below
- Code supporting the CVPR 2017 paper "Learning Cross-modal Embeddings for Cooking Recipes and Food Images"☆382Updated 2 months ago
- Retrieve recipes from foodie pictures using Deep Learning and Pytorch☆55Updated 3 years ago
- Vision-Language Pre-training for Image Captioning and Question Answering☆417Updated 3 years ago
- Conceptual Captions is a dataset containing (image-URL, caption) pairs designed for the training and evaluation of machine learned image …☆524Updated 3 years ago
- ☆473Updated 2 years ago
- Automatic image captioning model based on Caffe, using features from bottom-up attention.☆245Updated last year
- Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"☆531Updated last year
- deep learning, image retrieval, vision and language☆299Updated 3 years ago
- PyTorch bottom-up attention with Detectron2☆231Updated 3 years ago
- BERT + Image Captioning☆132Updated 4 years ago
- Good News Everyone! - CVPR 2019☆128Updated 2 years ago
- PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".☆940Updated 2 years ago
- Recognition to Cognition Networks (code for the model in "From Recognition to Cognition: Visual Commonsense Reasoning", CVPR 2019)☆465Updated 3 years ago
- Bilinear attention networks for visual question answering☆545Updated last year
- A Dataset for Grounded Video Description☆160Updated 3 years ago
- Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019☆282Updated 2 years ago
- A python toolkit for parsing captions (in natural language) into scene graphs (as symbolic representations).☆562Updated 11 months ago
- Learning Cross-Modal Embeddings with Adversarial Networks for Cooking Recipes and Food Images☆57Updated 5 years ago
- Pytorch code of for our CVPR 2018 paper "Neural Baby Talk"☆524Updated 5 years ago
- Code for CVPR 2021 paper: Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers and Self-supervised Learning☆82Updated 3 years ago
- An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.☆755Updated 10 months ago
- Recipe Generation from Food Images☆624Updated 5 years ago
- PyTorch Code for the paper "VSE++: Improving Visual-Semantic Embeddings with Hard Negatives"☆496Updated 3 years ago
- Supervised Multimodal Bitransformers for Classifying Images and Text☆248Updated 3 years ago
- Multi Task Vision and Language☆804Updated 2 years ago
- MUREL (CVPR 2019), a multimodal relational reasoning module for VQA☆195Updated 4 years ago
- MAttNet: Modular Attention Network for Referring Expression Comprehension☆294Updated 2 years ago
- Code for Unsupervised Image Captioning☆216Updated last year
- [CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…☆713Updated last year
- Transformer-based image captioning extension for pytorch/fairseq☆315Updated 4 years ago