Code and data for "Learning Program Representations for Food Images and Cooking Recipes" (oral at CVPR 2022)
☆15Mar 30, 2022Updated 3 years ago
Alternatives and similar repositories for cookingprograms
Users that are interested in cookingprograms are comparing it to the libraries listed below
Sorting:
- This repo contains the procedural generation pipeline used to generate CrashCar101☆16Jan 14, 2024Updated 2 years ago
- Curriculum Meta-Learning for Next POI Recommendation☆18Sep 6, 2021Updated 4 years ago
- Enhance robot task understanding ability through visual semantic graph☆10May 20, 2021Updated 4 years ago
- ☆12Mar 24, 2021Updated 4 years ago
- A repository for the updated version of CoinRun used to collect MUGEN, a multimodal video-audio-text dataset. This repo contains scripts …☆13Jul 13, 2022Updated 3 years ago
- code for the paper "Adversarial Reinforced Instruction Attacker for Robust Vision-Language Navigation" (TPAMI 2021)☆10Jul 15, 2022Updated 3 years ago
- ☆20Jun 30, 2025Updated 8 months ago
- Multimodal grounded language dataset☆11Dec 14, 2021Updated 4 years ago
- [IROS2020] Encoding formulas as deep networks: Reinforcement learning for zero-shot execution of LTL formulas☆10Mar 25, 2023Updated 2 years ago
- Official repo for Directional Self-supervised Learning for Heavy Image Augmentations [CVPR2022]☆12Jun 29, 2022Updated 3 years ago
- ☆22Aug 5, 2024Updated last year
- Code for MICCAI 2021 submission 'Self-Supervised Multi-Modal Alignment For Whole Body Medical Imaging'☆16Sep 22, 2021Updated 4 years ago
- KGML for EMNLP 2021☆10Feb 2, 2022Updated 4 years ago
- Code and data for "Inferring Rewards from Language in Context" [ACL 2022].☆16May 22, 2022Updated 3 years ago
- Materialist: Physically Based Editing Using Single-Image Inverse Rendering☆26Oct 24, 2025Updated 4 months ago
- Multi-Label Classification and Class Activation Map on Fashion MNIST☆11Mar 5, 2019Updated 7 years ago
- ☆11Sep 13, 2023Updated 2 years ago
- ☆13Dec 8, 2022Updated 3 years ago
- code for the paper "ADAPT: Vision-Language Navigation with Modality-Aligned Action Prompts" (CVPR 2022)☆10Jul 17, 2022Updated 3 years ago
- CLIP-based simple image-text matching baseline for COCO and F30K☆14Sep 16, 2021Updated 4 years ago
- Know What and Know Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation☆16Feb 7, 2022Updated 4 years ago
- Mapping Echo Chambers In Large Networks☆11Nov 8, 2024Updated last year
- Baseline implementations on the MuMiN dataset☆10Oct 18, 2023Updated 2 years ago
- ☆11Aug 10, 2021Updated 4 years ago
- The implementation of <Factual Consistency Evaluation for Text Summarization via Counterfactual Estimation> in PyTorch.☆17Nov 11, 2021Updated 4 years ago
- [ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model☆19Jul 20, 2024Updated last year
- Source code and data of our paper "Missing Counter-Evidence Renders NLP Fact-Checking Unrealistic for Misinformation" (https://arxiv.org/…☆10Jun 21, 2023Updated 2 years ago
- Code to accompany the "Implications of Topological Imbalance for Representation Learning on Biomedical Knowledge Graphs" (Briefings in B…☆19Feb 11, 2026Updated last month
- [Main EMNLP'25] LLMs do Multi-Label Classification Differently☆14Feb 28, 2026Updated 3 weeks ago
- Code for GLAT (Global Local Transformer), ECCV 2020 "Learning Visual Commonsense for Robust Scene Graph Generation"☆11Dec 16, 2020Updated 5 years ago
- ☆15Nov 3, 2022Updated 3 years ago
- CVPR2021☆12Mar 29, 2021Updated 4 years ago
- ☆11Feb 9, 2023Updated 3 years ago
- MulinforCPI: enhancing precision of compound-protein interaction prediction through novel perspectives on multi-level information integra…☆10Jun 20, 2024Updated last year
- Run CLIP inference on the ImageNet dataset and use these inferences as labels to train other models and again evaluate the trained model …☆12Jun 21, 2021Updated 4 years ago
- A PyTorch Implementation for our ECCV 2018 paper "Joint Person Segmentation and Identification in Synchronized First- and Third-person Vi…☆12Nov 20, 2019Updated 6 years ago
- [KDD 2025] The implementation of "Fine-tuning Multimodal Large Language Models for Product Bundling", KDD'25☆15Sep 20, 2025Updated 6 months ago
- ☆11Aug 7, 2020Updated 5 years ago
- Dataset for the investigation of visual semiotics, and how specific visual features and design choices can elicit specific emotions, thou…☆10Dec 13, 2023Updated 2 years ago