Code for CVPR 2021 paper: Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers and Self-supervised Learning
☆88Mar 24, 2021Updated 5 years ago
Alternatives and similar repositories for image-to-recipe-transformers
Users that are interested in image-to-recipe-transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of VLPCook: Vision and Structured-Language Pretraining for Cross-Modal Food Retrieval☆16Mar 25, 2023Updated 3 years ago
- Learning Cross-Modal Embeddings with Adversarial Networks for Cooking Recipes and Food Images☆58Jun 14, 2019Updated 6 years ago
- Learning Cross-Modal Embeddings with Adversarial Networks for Cooking Recipes and Food Images☆30Jun 14, 2019Updated 6 years ago
- Multi-Modal Transformer for Video Retrieval☆265Oct 9, 2024Updated last year
- Code & data for IJCAI'22 paper "Recipe2Vec: Multi-modal Recipe Representation Learning with Graph Neural Networks".☆14Jul 24, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Cross-Modal Center Loss for 3D Cross-Modal Retrieval (CVPR2021)☆35Apr 4, 2021Updated 4 years ago
- 📝 Recipe-related papers in NLP (e.g., ACL, EMNLP), CV (e.g., CVPR, ECCV), IR (e.g., SIGIR, RecSys), and HCI (e.g., CHI)☆34May 27, 2025Updated 10 months ago
- Code and data for "Learning Program Representations for Food Images and Cooking Recipes" (oral at CVPR 2022)☆15Mar 30, 2022Updated 4 years ago
- Source code of our TPAMI'21 paper Dual Encoding for Video Retrieval by Text and CVPR'19 paper Dual Encoding for Zero-Example Video Retrie…☆88Jan 10, 2023Updated 3 years ago
- Transductive Zero-Shot Hashing For Multi-Label Image Retrieval☆18Jan 18, 2021Updated 5 years ago
- Python implementation of cross-modal hashing algorithms☆22Nov 17, 2022Updated 3 years ago
- Cross-Modal-Hashing-Retrieval/Multi-Modal-Hashing-Retrieval☆26Jun 20, 2023Updated 2 years ago
- PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning☆89Jul 7, 2021Updated 4 years ago
- COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning☆291Sep 6, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- The code for Deep Joint-Semantics Reconstructing Hashing for Large-Scale Unsupervised Cross-Modal Retrieval (ICCV 2019)☆85Nov 27, 2019Updated 6 years ago
- We deal with the problem of zero-shot cross-modal image retrieval involving color and sketch images through a novel deep representation l…☆14Sep 13, 2021Updated 4 years ago
- Retrieve recipes from foodie pictures using Deep Learning and Pytorch☆59Feb 22, 2021Updated 5 years ago
- Generalized Product Quantization Network For Semi-supervised Image Retrieval - CVPR 2020☆63May 27, 2024Updated last year
- The official codes for paper "Deep hash learning for remote sensing image retrieval"☆21Nov 16, 2020Updated 5 years ago
- Deep Metric and Hash Code Learning Network for Content Based Retrieval of Remote Sensing Images☆38Mar 1, 2020Updated 6 years ago
- Source Code for Online Collective Matrix Factorization Hashing. Reference: Di Wang, Quan Wang, Yaqiang An, Xinbo Gao, and Yumin Tian. 202…☆11Oct 20, 2020Updated 5 years ago
- SIGIR paper Conversational Fashion Image Retrieval via Multiturn Natural Language Feedback☆13Oct 17, 2022Updated 3 years ago
- Video embeddings for retrieval with natural language queries☆343Feb 15, 2023Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A pytorch implementation of "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering" for image captioning.☆48Nov 15, 2021Updated 4 years ago
- Hyperbolic Visual Embedding Learning for Zero-Shot Recognition (CVPR 2020)☆82Jul 6, 2023Updated 2 years ago
- ☆131Dec 10, 2022Updated 3 years ago
- Implementation and Benchmark Splits to study Out-of-Distribution Generalization in Deep Metric Learning.☆25Oct 2, 2021Updated 4 years ago
- Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"☆23Mar 4, 2020Updated 6 years ago
- BATCH: A Scalable Asymmetric Discrete Cross-Modal Hashing☆12Sep 22, 2025Updated 6 months ago
- [CVPR 2021] Generative Hierarchical Features from Synthesizing Images☆158Apr 13, 2021Updated 4 years ago
- Code release for "MORE: Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning"☆11Oct 11, 2024Updated last year
- PyTorch code for "SOLAR: Second-Order Loss and Attention for Image Retrieval". In ECCV 2020☆176May 28, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of our CVPR2020 paper, Graph Structured Network for Image-Text Matching☆170Oct 12, 2020Updated 5 years ago
- Implementation of accepted AAAI 2021 paper: Deep Unsupervised Image Hashing by Maximizing Bit Entropy☆84Dec 2, 2021Updated 4 years ago
- (ECCV 2020) This repo contains code for "DiVA: Diverse Visual Feature Aggregation for Deep Metric Learning" (https://arxiv.org/abs/2004.1…☆36Dec 5, 2021Updated 4 years ago
- Adaptive Cross-Modal Embeddings for Image-Sentence Alignment☆36Oct 3, 2023Updated 2 years ago
- Placeholder for code of BSP.☆11Aug 13, 2021Updated 4 years ago
- ☆24May 31, 2022Updated 3 years ago
- Implementation for "Multilevel Language and Vision Integration for Text-to-Clip Retrieval"☆49Jan 21, 2019Updated 7 years ago