amzn / image-to-recipe-transformersView external linksLinks
Code for CVPR 2021 paper: Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers and Self-supervised Learning
☆88Mar 24, 2021Updated 4 years ago
Alternatives and similar repositories for image-to-recipe-transformers
Users that are interested in image-to-recipe-transformers are comparing it to the libraries listed below
Sorting:
- Source code of our TPAMI'21 paper Dual Encoding for Video Retrieval by Text and CVPR'19 paper Dual Encoding for Zero-Example Video Retrie…☆88Jan 10, 2023Updated 3 years ago
- Multi-Modal Transformer for Video Retrieval☆265Oct 9, 2024Updated last year
- Learning Cross-Modal Embeddings with Adversarial Networks for Cooking Recipes and Food Images☆30Jun 14, 2019Updated 6 years ago
- Transductive Zero-Shot Hashing For Multi-Label Image Retrieval☆18Jan 18, 2021Updated 5 years ago
- 📝 Recipe-related papers in NLP (e.g., ACL, EMNLP), CV (e.g., CVPR, ECCV), IR (e.g., SIGIR, RecSys), and HCI (e.g., CHI)☆34May 27, 2025Updated 8 months ago
- Cross-Modal Center Loss for 3D Cross-Modal Retrieval (CVPR2021)☆35Apr 4, 2021Updated 4 years ago
- Implementation and Benchmark Splits to study Out-of-Distribution Generalization in Deep Metric Learning.☆25Oct 2, 2021Updated 4 years ago
- Python implementation of cross-modal hashing algorithms☆22Nov 17, 2022Updated 3 years ago
- Joint-modal Distribution-based Similarity Hashing for Large-scale Unsupervised Deep Cross-modal Retrieval☆25Dec 10, 2020Updated 5 years ago
- Generalized Product Quantization Network For Semi-supervised Image Retrieval - CVPR 2020☆63May 27, 2024Updated last year
- The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretr…☆443Sep 25, 2025Updated 4 months ago
- Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"☆23Mar 4, 2020Updated 5 years ago
- SIGIR paper Conversational Fashion Image Retrieval via Multiturn Natural Language Feedback☆14Oct 17, 2022Updated 3 years ago
- Deep Clustering and Block Hashing Network for Face Image Retrieval - ACCV 2018☆13Nov 5, 2021Updated 4 years ago
- PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning☆89Jul 7, 2021Updated 4 years ago
- [CVPR 2021] Generative Hierarchical Features from Synthesizing Images☆158Apr 13, 2021Updated 4 years ago
- Hyperbolic Visual Embedding Learning for Zero-Shot Recognition (CVPR 2020)☆82Jul 6, 2023Updated 2 years ago
- Video embeddings for retrieval with natural language queries☆342Feb 15, 2023Updated 3 years ago
- Code and benchmarks for the Semantic Video Retrieval Task☆53Oct 18, 2022Updated 3 years ago
- ☆131Dec 10, 2022Updated 3 years ago
- COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning☆291Sep 6, 2022Updated 3 years ago
- Siamese graph convolutional network for content based remote sensing image retrieval☆14Sep 13, 2021Updated 4 years ago
- A toolbox to explore synchronous layerwise-parallel deep neural networks.☆17Jul 29, 2019Updated 6 years ago
- A simple demo of distributed training in Pytorch☆36Sep 30, 2019Updated 6 years ago
- (ECCV 2020) This repo contains code for "DiVA: Diverse Visual Feature Aggregation for Deep Metric Learning" (https://arxiv.org/abs/2004.1…☆36Dec 5, 2021Updated 4 years ago
- We deal with the problem of zero-shot cross-modal image retrieval involving color and sketch images through a novel deep representation l…☆14Sep 13, 2021Updated 4 years ago
- Source code of our TCSVT 2017 paper "SSDH: Semi-supervised Deep Hashing for Large Scale Image Retrieval"☆15May 29, 2019Updated 6 years ago
- Use CLIP to represent video for Retrieval Task☆70Mar 1, 2021Updated 4 years ago
- Word2VisualVec : Predicting Visual Features from Text for Image and Video Caption Retrieval☆70Jan 27, 2020Updated 6 years ago
- The official codes for paper "Deep hash learning for remote sensing image retrieval"☆21Nov 16, 2020Updated 5 years ago
- Official implementation of the Composed Image Retrieval using Pretrained LANguage Transformers (CIRPLANT) | ICCV 2021 - Image Retrieval o…☆40Jun 26, 2024Updated last year
- Extension of Self-Supervised Temporal Hashing☆14Apr 15, 2021Updated 4 years ago
- Codes for our CVPR 2021 paper "Deep Compositional Metric Learning"☆21Aug 23, 2021Updated 4 years ago
- ☆16Mar 15, 2021Updated 4 years ago
- ☆15Mar 20, 2020Updated 5 years ago
- ☆42Apr 25, 2021Updated 4 years ago
- A pytorch implementation of "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering" for image captioning.☆48Nov 15, 2021Updated 4 years ago
- PyTorch code for "SOLAR: Second-Order Loss and Attention for Image Retrieval". In ECCV 2020☆176May 28, 2021Updated 4 years ago
- [CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…☆723Aug 8, 2023Updated 2 years ago