Official Repo for FoodieQA paper (EMNLP 2024)
☆20Jun 26, 2025Updated 9 months ago
Alternatives and similar repositories for FoodieQA
Users that are interested in FoodieQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch implementation of the paper: "What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Vision-Language Models." …☆10Mar 7, 2025Updated last year
- [ACL 2024] Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue☆26Oct 18, 2025Updated 6 months ago
- 基于InternLm chat 7B大模型基座,构建一个Agent ,可以调用 MMYOLO 工具来完成图像内视觉任务☆11Oct 30, 2024Updated last year
- ☆11Mar 29, 2021Updated 5 years ago
- Code of Graph Contrastive Partial Multi-View Clustering☆12Mar 10, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code of UEAF, AAAI,2019.☆13Jul 28, 2023Updated 2 years ago
- This is the CODE of Reciprocal Multi-Layer Subspace Learning for Multi-View Clustering☆15Nov 4, 2020Updated 5 years ago
- ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation. AAAI, 2025☆13Aug 25, 2025Updated 7 months ago
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- ☆17Jan 12, 2024Updated 2 years ago
- LogiCity@NeurIPS'24, D&B track. A multi-agent inductive learning environment for "abstractions".☆27Jun 10, 2025Updated 10 months ago
- ☆29Oct 8, 2024Updated last year
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆26May 14, 2024Updated last year
- ☆10Oct 31, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆45Jun 30, 2024Updated last year
- LLM as World Models using Bayesian inference☆17May 27, 2025Updated 10 months ago
- An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation☆16Oct 27, 2024Updated last year
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆23Jul 1, 2024Updated last year
- [ACL'25 Main] Graph of Records: Boosting Retrieval Augmented Generation for Long-context Summarization with Graphs☆41May 26, 2025Updated 10 months ago
- ☆29Sep 15, 2020Updated 5 years ago
- ☆13Nov 5, 2024Updated last year
- the datasets of our paper☆11Feb 26, 2024Updated 2 years ago
- [ICML 2025] Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence Modeling☆13May 5, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICLR'26] Stronger-MAS: A RL Framework for multi LLM agent system☆151Updated this week
- Cross-modal Hierarchical Modelling for FGSBIR. Work accepted for Oral presentation in BMVC 2020☆18Sep 8, 2023Updated 2 years ago
- Incomplete Multi-view Clustering via Graph Regularized Matrix Factorization☆20Feb 7, 2022Updated 4 years ago
- a benchmark to evaluate the situated inductive reasoning☆15Jan 7, 2025Updated last year
- Localized Sparse Incomplete Multi-view Clustering☆25May 17, 2023Updated 2 years ago
- ☆13Oct 21, 2021Updated 4 years ago
- ☆36Aug 25, 2022Updated 3 years ago
- Crack Detection Based on Infrared thermography (IR)☆17Jun 26, 2024Updated last year
- ☆30Jan 8, 2026Updated 3 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- MCPL: Multi-modal Collaborative Prompt Learning for Medical Vision-Language Model (Initial Version)☆13Apr 17, 2024Updated 2 years ago
- Source Code & Datasets for "Vertical Federated Principal Component Analysis and Its Kernel Extension on Feature-wise Distributed Data"☆12May 20, 2022Updated 3 years ago
- ICML2024: Equivariant Graph Neural Operator for Modeling 3D Dynamics☆61Mar 27, 2024Updated 2 years ago
- MISCA: A Joint Model for Multiple Intent Detection and Slot Filling with Intent-Slot Co-Attention (EMNLP 2023 - Findings)☆33Jul 22, 2024Updated last year
- Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval (ICCV 2025 Highlight)☆21Aug 1, 2025Updated 8 months ago
- A free tool that helps you transcribe, translate, and summarize videos in any language.☆18Feb 27, 2024Updated 2 years ago
- ☆16Sep 8, 2021Updated 4 years ago