Code for CVPR 2021 paper: Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers and Self-supervised Learning
☆88Mar 24, 2021Updated 4 years ago
Alternatives and similar repositories for image-to-recipe-transformers
Users that are interested in image-to-recipe-transformers are comparing it to the libraries listed below
Sorting:
- Official implementation of VLPCook: Vision and Structured-Language Pretraining for Cross-Modal Food Retrieval☆15Mar 25, 2023Updated 2 years ago
- [CVPRW22] Official Implementation of T-Food: "Transformer Decoders with MultiModal Regularization for Cross-Modal Food Retrieval". Accept…☆34Jul 8, 2022Updated 3 years ago
- Learning Cross-Modal Embeddings with Adversarial Networks for Cooking Recipes and Food Images☆58Jun 14, 2019Updated 6 years ago
- Multi-Modal Transformer for Video Retrieval☆265Oct 9, 2024Updated last year
- Learning Cross-Modal Embeddings with Adversarial Networks for Cooking Recipes and Food Images☆30Jun 14, 2019Updated 6 years ago
- Transductive Zero-Shot Hashing For Multi-Label Image Retrieval☆18Jan 18, 2021Updated 5 years ago
- Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective☆14Oct 22, 2024Updated last year
- Implementation and Benchmark Splits to study Out-of-Distribution Generalization in Deep Metric Learning.☆25Oct 2, 2021Updated 4 years ago
- Python implementation of cross-modal hashing algorithms☆22Nov 17, 2022Updated 3 years ago
- Generalized Product Quantization Network For Semi-supervised Image Retrieval - CVPR 2020☆63May 27, 2024Updated last year
- The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretr…☆443Sep 25, 2025Updated 5 months ago
- Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"☆23Mar 4, 2020Updated 6 years ago
- A Master Thesis Project on Video Keyword Extractor using Video Summarization techniques.☆11Oct 25, 2020Updated 5 years ago
- PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning☆89Jul 7, 2021Updated 4 years ago
- Hyperbolic Visual Embedding Learning for Zero-Shot Recognition (CVPR 2020)☆81Jul 6, 2023Updated 2 years ago
- Video embeddings for retrieval with natural language queries☆342Feb 15, 2023Updated 3 years ago
- Code and benchmarks for the Semantic Video Retrieval Task☆53Oct 18, 2022Updated 3 years ago
- COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning☆291Sep 6, 2022Updated 3 years ago
- A toolbox to explore synchronous layerwise-parallel deep neural networks.☆17Jul 29, 2019Updated 6 years ago
- (ECCV 2020) This repo contains code for "DiVA: Diverse Visual Feature Aggregation for Deep Metric Learning" (https://arxiv.org/abs/2004.1…☆36Dec 5, 2021Updated 4 years ago
- We deal with the problem of zero-shot cross-modal image retrieval involving color and sketch images through a novel deep representation l…☆14Sep 13, 2021Updated 4 years ago
- Ever felt tired after preprocessing the dataset, and not wanting to write any code further to train your model? Ever encountered a situat…☆19May 8, 2021Updated 4 years ago
- Source code of our TCSVT 2017 paper "SSDH: Semi-supervised Deep Hashing for Large Scale Image Retrieval"☆15May 29, 2019Updated 6 years ago
- Use CLIP to represent video for Retrieval Task☆70Mar 1, 2021Updated 5 years ago
- Word2VisualVec : Predicting Visual Features from Text for Image and Video Caption Retrieval☆70Jan 27, 2020Updated 6 years ago
- The official codes for paper "Deep hash learning for remote sensing image retrieval"☆21Nov 16, 2020Updated 5 years ago
- Official implementation of the Composed Image Retrieval using Pretrained LANguage Transformers (CIRPLANT) | ICCV 2021 - Image Retrieval o…☆40Jun 26, 2024Updated last year
- Codes for our CVPR 2021 paper "Deep Compositional Metric Learning"☆21Aug 23, 2021Updated 4 years ago
- ☆16Mar 15, 2021Updated 4 years ago
- Extension of Self-Supervised Temporal Hashing☆14Apr 15, 2021Updated 4 years ago
- Implementation of accepted AAAI 2021 paper: Deep Unsupervised Image Hashing by Maximizing Bit Entropy☆84Dec 2, 2021Updated 4 years ago
- Deep learning cross modal hashing in PyTorch☆109Oct 7, 2021Updated 4 years ago
- ☆15Mar 20, 2020Updated 5 years ago
- ☆42Apr 25, 2021Updated 4 years ago
- Support extracting BUTD features for NLVR2 images.☆18Sep 5, 2020Updated 5 years ago
- Implementation of TC-Net for iSBIR: Triplet Classification Network for instance-level Sketch Based Image Retrieval.☆21Feb 23, 2020Updated 6 years ago
- PyTorch code for "SOLAR: Second-Order Loss and Attention for Image Retrieval". In ECCV 2020☆176May 28, 2021Updated 4 years ago
- [CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…☆729Aug 8, 2023Updated 2 years ago
- EDUVSUM is a multimodal neural architecture that utilizes state-of-the-art audio, visual and textual features to identify important tempo…☆23Mar 8, 2024Updated 2 years ago