Code for CVPR 2021 paper: Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers and Self-supervised Learning
☆89Mar 24, 2021Updated 5 years ago
Alternatives and similar repositories for image-to-recipe-transformers
Users that are interested in image-to-recipe-transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of VLPCook: Vision and Structured-Language Pretraining for Cross-Modal Food Retrieval☆16Mar 25, 2023Updated 3 years ago
- Learning Cross-Modal Embeddings with Adversarial Networks for Cooking Recipes and Food Images☆58Jun 14, 2019Updated 6 years ago
- Learning Cross-Modal Embeddings with Adversarial Networks for Cooking Recipes and Food Images☆30Jun 14, 2019Updated 6 years ago
- Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective☆15Oct 22, 2024Updated last year
- im2recipe Pytorch implementation☆298Mar 18, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Joint-modal Distribution-based Similarity Hashing for Large-scale Unsupervised Deep Cross-modal Retrieval☆25Dec 10, 2020Updated 5 years ago
- Cross-Modal Center Loss for 3D Cross-Modal Retrieval (CVPR2021)☆34Apr 4, 2021Updated 5 years ago
- 📝 Recipe-related papers in NLP (e.g., ACL, EMNLP), CV (e.g., CVPR, ECCV), IR (e.g., SIGIR, RecSys), and HCI (e.g., CHI)☆33May 27, 2025Updated last year
- Code and data for "Learning Program Representations for Food Images and Cooking Recipes" (oral at CVPR 2022)☆15Mar 30, 2022Updated 4 years ago
- Transductive Zero-Shot Hashing For Multi-Label Image Retrieval☆18Jan 18, 2021Updated 5 years ago
- The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretr…☆445Sep 25, 2025Updated 8 months ago
- Cross-Modal-Hashing-Retrieval/Multi-Modal-Hashing-Retrieval☆26Jun 20, 2023Updated 2 years ago
- COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning☆291Sep 6, 2022Updated 3 years ago
- The code for Deep Joint-Semantics Reconstructing Hashing for Large-Scale Unsupervised Cross-Modal Retrieval (ICCV 2019)☆86Nov 27, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- We deal with the problem of zero-shot cross-modal image retrieval involving color and sketch images through a novel deep representation l…☆14Sep 13, 2021Updated 4 years ago
- Retrieve recipes from foodie pictures using Deep Learning and Pytorch☆59Feb 22, 2021Updated 5 years ago
- Generalized Product Quantization Network For Semi-supervised Image Retrieval - CVPR 2020☆63May 27, 2024Updated 2 years ago
- Siamese graph convolutional network for content based remote sensing image retrieval☆14Sep 13, 2021Updated 4 years ago
- Deep learning cross modal hashing in PyTorch☆109Oct 7, 2021Updated 4 years ago
- The official codes for paper "Deep hash learning for remote sensing image retrieval"☆21Nov 16, 2020Updated 5 years ago
- Deep Metric and Hash Code Learning Network for Content Based Retrieval of Remote Sensing Images☆38Mar 1, 2020Updated 6 years ago
- Source Code for Online Collective Matrix Factorization Hashing. Reference: Di Wang, Quan Wang, Yaqiang An, Xinbo Gao, and Yumin Tian. 202…☆11Oct 20, 2020Updated 5 years ago
- Video embeddings for retrieval with natural language queries☆344Feb 15, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A pytorch implementation of "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering" for image captioning.☆48Nov 15, 2021Updated 4 years ago
- Hyperbolic Visual Embedding Learning for Zero-Shot Recognition (CVPR 2020)☆82Jul 6, 2023Updated 2 years ago
- Official implementation of the Composed Image Retrieval using Pretrained LANguage Transformers (CIRPLANT) | ICCV 2021 - Image Retrieval o…☆40Jun 26, 2024Updated last year
- Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"☆23Mar 4, 2020Updated 6 years ago
- BATCH: A Scalable Asymmetric Discrete Cross-Modal Hashing☆12Sep 22, 2025Updated 8 months ago
- [CVPR 2021] Generative Hierarchical Features from Synthesizing Images☆158Apr 13, 2021Updated 5 years ago
- Source code for paper "Supervised Discrete Hashing" on CVPR-2015☆21Jan 1, 2020Updated 6 years ago
- Code release for "MORE: Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning"☆11Oct 11, 2024Updated last year
- Code and benchmarks for the Semantic Video Retrieval Task☆53Oct 18, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- PyTorch code for "SOLAR: Second-Order Loss and Attention for Image Retrieval". In ECCV 2020☆176May 28, 2021Updated 5 years ago
- Implementation of accepted AAAI 2021 paper: Deep Unsupervised Image Hashing by Maximizing Bit Entropy☆84Dec 2, 2021Updated 4 years ago
- (ECCV 2020) This repo contains code for "DiVA: Diverse Visual Feature Aggregation for Deep Metric Learning" (https://arxiv.org/abs/2004.1…☆36Dec 5, 2021Updated 4 years ago
- Dense Regression Network for Video Grounding (CVPR2020)☆53Jan 28, 2021Updated 5 years ago
- ☆42Apr 25, 2021Updated 5 years ago
- Source code for paper "Discrete Latent Factor Model for Cross-Modal Hashing"☆18Aug 21, 2020Updated 5 years ago
- [CVPR 2024] MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos☆38Jan 29, 2025Updated last year