torralba-lab / im2recipe-Pytorch
im2recipe Pytorch implementation
☆271Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for im2recipe-Pytorch
- Code supporting the CVPR 2017 paper "Learning Cross-modal Embeddings for Cooking Recipes and Food Images"☆377Updated 3 weeks ago
- Retrieve recipes from foodie pictures using Deep Learning and Pytorch☆52Updated 3 years ago
- Vision-Language Pre-training for Image Captioning and Question Answering☆412Updated 2 years ago
- Code for the HowTo100M paper☆252Updated 4 years ago
- Learning Cross-Modal Embeddings with Adversarial Networks for Cooking Recipes and Food Images☆57Updated 5 years ago
- A Dataset for Grounded Video Description☆159Updated 2 years ago
- Supervised Multimodal Bitransformers for Classifying Images and Text☆245Updated 3 years ago
- PyTorch Code for the paper "VSE++: Improving Visual-Semantic Embeddings with Hard Negatives"☆491Updated 2 years ago
- [EMNLP 2018] PyTorch code for TVQA: Localized, Compositional Video Question Answering☆172Updated 2 years ago
- ☆146Updated 2 years ago
- Conceptual Captions is a dataset containing (image-URL, caption) pairs designed for the training and evaluation of machine learned image …☆519Updated 3 years ago
- Recipe Generation from Food Images☆620Updated 5 years ago
- Video embeddings for retrieval with natural language queries☆336Updated last year
- Mixture-of-Embeddings-Experts☆118Updated 4 years ago
- Fashion 200K dataset used in paper "Automatic Spatially-aware Fashion Concept Discovery."☆63Updated 2 years ago
- Data and code for CVPR 2020 paper: "VIOLIN: A Large-Scale Dataset for Video-and-Language Inference"☆159Updated 4 years ago
- Cornell NLVR and NLVR2 are natural language grounding datasets. Each example shows a visual input and a sentence describing it, and is an…☆257Updated 2 years ago
- PyTorch bottom-up attention with Detectron2☆230Updated 2 years ago
- ☆188Updated 3 years ago
- Learning to align and match videos with kernelized temporal layers☆138Updated 3 years ago
- deep learning, image retrieval, vision and language☆300Updated 3 years ago
- TVSum: Title-based Video Summarization dataset (CVPR 2015)☆121Updated 5 years ago
- Recognition to Cognition Networks (code for the model in "From Recognition to Cognition: Visual Commonsense Reasoning", CVPR 2019)☆466Updated 3 years ago
- Pytorch implementation of the image-sentence embedding method described in "Unifying Visual-Semantic Embeddings with Multimodal Neural La…☆86Updated 7 years ago
- Starter code in PyTorch for the Visual Dialog challenge☆192Updated last year
- Multi Task Vision and Language☆800Updated 2 years ago
- Automatic image captioning model based on Caffe, using features from bottom-up attention.☆244Updated last year
- Code for CVPR 2021 paper: Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers and Self-supervised Learning☆83Updated 3 years ago
- Visual Question Answering Project with state of the art single Model performance.☆132Updated 6 years ago
- Pytorch code of for our CVPR 2018 paper "Neural Baby Talk"☆524Updated 5 years ago