torralba-lab / im2recipe-Pytorch
im2recipe Pytorch implementation
☆284Updated last year
Alternatives and similar repositories for im2recipe-Pytorch
Users that are interested in im2recipe-Pytorch are comparing it to the libraries listed below
Sorting:
- Code supporting the CVPR 2017 paper "Learning Cross-modal Embeddings for Cooking Recipes and Food Images"☆397Updated 6 months ago
- Retrieve recipes from foodie pictures using Deep Learning and Pytorch☆57Updated 4 years ago
- Recipe Generation from Food Images☆631Updated 5 years ago
- Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"☆535Updated 2 years ago
- PyTorch Code for the paper "VSE++: Improving Visual-Semantic Embeddings with Hard Negatives"☆507Updated 3 years ago
- Vision-Language Pre-training for Image Captioning and Question Answering☆418Updated 3 years ago
- Learning Cross-Modal Embeddings with Adversarial Networks for Cooking Recipes and Food Images☆58Updated 5 years ago
- ☆476Updated 2 years ago
- Conceptual Captions is a dataset containing (image-URL, caption) pairs designed for the training and evaluation of machine learned image …☆535Updated 3 years ago
- deep learning, image retrieval, vision and language☆303Updated 4 years ago
- A Dataset for Grounded Video Description☆162Updated 3 years ago
- Multi Task Vision and Language☆811Updated 3 years ago
- Video embeddings for retrieval with natural language queries☆341Updated 2 years ago
- [EMNLP 2018] PyTorch code for TVQA: Localized, Compositional Video Question Answering☆178Updated 2 years ago
- Data and code for CVPR 2020 paper: "VIOLIN: A Large-Scale Dataset for Video-and-Language Inference"☆161Updated 5 years ago
- Recognition to Cognition Networks (code for the model in "From Recognition to Cognition: Visual Commonsense Reasoning", CVPR 2019)☆465Updated 4 years ago
- [CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations☆563Updated last year
- Visual Q&A reading list☆437Updated 6 years ago
- ☆154Updated 3 years ago
- COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning☆288Updated 2 years ago
- Pytorch code of for our CVPR 2018 paper "Neural Baby Talk"☆525Updated 6 years ago
- Automatic image captioning model based on Caffe, using features from bottom-up attention.☆245Updated 2 years ago
- ☆189Updated 3 years ago
- Mixture-of-Embeddings-Experts☆119Updated 4 years ago
- MUREL (CVPR 2019), a multimodal relational reasoning module for VQA☆195Updated 5 years ago
- Code for the HowTo100M paper☆269Updated 5 years ago
- A python wrapper for the Visual Genome API☆363Updated last year
- Research code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"☆232Updated 3 years ago
- A lightweight, scalable, and general framework for visual question answering research☆323Updated 3 years ago
- PyTorch bottom-up attention with Detectron2☆233Updated 3 years ago