torralba-lab / im2recipe-Pytorch
im2recipe Pytorch implementation
☆282Updated last year
Alternatives and similar repositories for im2recipe-Pytorch:
Users that are interested in im2recipe-Pytorch are comparing it to the libraries listed below
- Code supporting the CVPR 2017 paper "Learning Cross-modal Embeddings for Cooking Recipes and Food Images"☆394Updated 5 months ago
- Retrieve recipes from foodie pictures using Deep Learning and Pytorch☆57Updated 4 years ago
- ☆154Updated 3 years ago
- Vision-Language Pre-training for Image Captioning and Question Answering☆417Updated 3 years ago
- Learning Cross-Modal Embeddings with Adversarial Networks for Cooking Recipes and Food Images☆58Updated 5 years ago
- Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"☆535Updated last year
- ☆476Updated 2 years ago
- Conceptual Captions is a dataset containing (image-URL, caption) pairs designed for the training and evaluation of machine learned image …☆534Updated 3 years ago
- Fashion 200K dataset used in paper "Automatic Spatially-aware Fashion Concept Discovery."☆64Updated 3 years ago
- PyTorch Code for the paper "VSE++: Improving Visual-Semantic Embeddings with Hard Negatives"☆504Updated 3 years ago
- deep learning, image retrieval, vision and language☆302Updated 4 years ago
- COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning☆288Updated 2 years ago
- A Dataset for Grounded Video Description☆162Updated 3 years ago
- Recognition to Cognition Networks (code for the model in "From Recognition to Cognition: Visual Commonsense Reasoning", CVPR 2019)☆466Updated 3 years ago
- Multi Task Vision and Language☆811Updated 3 years ago
- [EMNLP 2018] PyTorch code for TVQA: Localized, Compositional Video Question Answering☆177Updated 2 years ago
- Good News Everyone! - CVPR 2019☆128Updated 3 years ago
- Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"☆792Updated 3 years ago
- PyTorch bottom-up attention with Detectron2☆233Updated 3 years ago
- Video embeddings for retrieval with natural language queries☆341Updated 2 years ago
- 🖼️ Attend to You: Personalized Image Captioning with Context Sequence Memory Networks. In CVPR, 2017. Expanded : Towards Personalized Im…☆206Updated 4 years ago
- Code for the HowTo100M paper☆267Updated 5 years ago
- An implementation that downstreams pre-trained V+L models to VQA tasks. Now support: VisualBERT, LXMERT, and UNITER☆163Updated 2 years ago
- Automatic image captioning model based on Caffe, using features from bottom-up attention.☆245Updated 2 years ago
- ☆189Updated 3 years ago
- Pytorch code of for our CVPR 2018 paper "Neural Baby Talk"☆524Updated 6 years ago
- Data and code for CVPR 2020 paper: "VIOLIN: A Large-Scale Dataset for Video-and-Language Inference"☆161Updated 4 years ago
- The iMaterialist Fashion Attribute Dataset☆85Updated 4 years ago
- Supervised Multimodal Bitransformers for Classifying Images and Text☆252Updated 3 years ago
- Official code for WACV 2021 paper - Compositional Learning of Image-Text Query for Image Retrieval☆57Updated 3 years ago