torralba-lab / im2recipe-Pytorch
im2recipe Pytorch implementation
☆278Updated last year
Alternatives and similar repositories for im2recipe-Pytorch:
Users that are interested in im2recipe-Pytorch are comparing it to the libraries listed below
- Code supporting the CVPR 2017 paper "Learning Cross-modal Embeddings for Cooking Recipes and Food Images"☆388Updated 5 months ago
- Retrieve recipes from foodie pictures using Deep Learning and Pytorch☆55Updated 4 years ago
- Good News Everyone! - CVPR 2019☆128Updated 2 years ago
- Vision-Language Pre-training for Image Captioning and Question Answering☆417Updated 3 years ago
- Learning Cross-Modal Embeddings with Adversarial Networks for Cooking Recipes and Food Images☆57Updated 5 years ago
- deep learning, image retrieval, vision and language☆301Updated 3 years ago
- ☆153Updated 3 years ago
- [EMNLP 2018] PyTorch code for TVQA: Localized, Compositional Video Question Answering☆173Updated 2 years ago
- PyTorch Code for the paper "VSE++: Improving Visual-Semantic Embeddings with Hard Negatives"☆502Updated 3 years ago
- A Dataset for Grounded Video Description☆160Updated 3 years ago
- Starter code in PyTorch for the Visual Dialog challenge☆192Updated 2 years ago
- Recognition to Cognition Networks (code for the model in "From Recognition to Cognition: Visual Commonsense Reasoning", CVPR 2019)☆465Updated 3 years ago
- Data and code for CVPR 2020 paper: "VIOLIN: A Large-Scale Dataset for Video-and-Language Inference"☆160Updated 4 years ago
- ☆476Updated 2 years ago
- PyTorch bottom-up attention with Detectron2☆233Updated 3 years ago
- Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering☆105Updated 5 years ago
- Image Caption and Text to Image papers.☆68Updated 7 years ago
- Code for "Semantic Object Accuracy for Generative Text-to-Image Synthesis" (TPAMI 2020)☆105Updated 3 years ago
- Transformer-based image captioning extension for pytorch/fairseq☆314Updated 4 years ago
- ☆54Updated 5 years ago
- Supervised Multimodal Bitransformers for Classifying Images and Text☆252Updated 3 years ago
- Code to train and evaluate the GeNeVA-GAN model for the GeNeVA task proposed in our ICCV 2019 paper "Tell, Draw, and Repeat: Generating a…☆85Updated 2 years ago
- Code for CVPR 2021 paper: Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers and Self-supervised Learning☆82Updated 4 years ago
- The first public PyTorch implementation of Skip-Thought Vectors☆224Updated 7 years ago
- Pytorch implementation of the image-sentence embedding method described in "Unifying Visual-Semantic Embeddings with Multimodal Neural La…☆86Updated 7 years ago
- 🖼️ Attend to You: Personalized Image Captioning with Context Sequence Memory Networks. In CVPR, 2017. Expanded : Towards Personalized Im…☆206Updated 4 years ago
- Baseline model for nocaps benchmark, ICCV 2019 paper "nocaps: novel object captioning at scale".☆75Updated last year
- ☆189Updated 3 years ago
- MAttNet: Modular Attention Network for Referring Expression Comprehension☆293Updated 2 years ago
- Conceptual Captions is a dataset containing (image-URL, caption) pairs designed for the training and evaluation of machine learned image …☆532Updated 3 years ago