Multimodal Instruction Tuning for Llama 3
☆52Apr 25, 2024Updated 2 years ago
Alternatives and similar repositories for llama-multimodal-vqa
Users that are interested in llama-multimodal-vqa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official implementation of ImageBind-LLM and Whisper-LLM from the paper "Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Compre…☆21Oct 30, 2023Updated 2 years ago
- Local self-attention in Transformer for visual question answering☆13Mar 17, 2024Updated 2 years ago
- RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering☆10Nov 27, 2022Updated 3 years ago
- The official implementation of our work Hawkeye: Discovering and Grounding Implicit Anomalous Sentiment in Recon-videos via Scene-enhanc…☆13Oct 14, 2024Updated last year
- RAG Based LLM Chatbot Built using Open Source Stack (Llama 3.2 Model, BGE Embeddings, and Qdrant running locally within a Docker Containe…☆20Jan 9, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [TMM 2024] Official Implementation for User Identity Linkage Across Social Media via Attentive Time-Aware User Modeling☆11Apr 8, 2026Updated 3 weeks ago
- Conversational agents for engineering simulations with minimal human input using Microsoft AutoGen & GPT-4o.☆41Aug 4, 2024Updated last year
- 《大语言模型》综述全书学习笔记☆12Aug 2, 2024Updated last year
- Official implementation for the MM'22 paper.☆14Jun 30, 2022Updated 3 years ago
- Weakly Supervised Gaussian Contrastive Grounding with Large Multimodal Models for Video Question Answering [ACM MM'24]☆10Jul 22, 2024Updated last year
- genES-MDA is a generic Python open-source software package to solve inverse problems via the Ensemble Smoother with Multiple Data Assimil…☆12Mar 9, 2026Updated last month
- ☆19May 31, 2023Updated 2 years ago
- Retrieval-Enhanced Context-Aware Prefix Encoder for Personalized Dialogue Response Generation☆19Aug 26, 2023Updated 2 years ago
- Source code of Venus-MAXWELL: Efficient Learning of Protein-Mutation Stability Landscapes using Protein Language Models☆24Jun 3, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆18Mar 23, 2022Updated 4 years ago
- Convexified Convolutional Neural Networks☆15Dec 13, 2016Updated 9 years ago
- Github repo for Peifeng's internship project☆13Nov 7, 2023Updated 2 years ago
- The code of IJCAI2022 paper, Declaration-based Prompt Tuning for Visual Question Answering☆20May 10, 2022Updated 3 years ago
- ☆17Feb 27, 2026Updated 2 months ago
- Code for the paper "Molecule Design by Latent Space Energy-based Modeling and Gradual Distribution Shifting" in UAI 2023☆15Nov 15, 2023Updated 2 years ago
- A curated collection of cutting-edge research at the intersection of machine learning and healthcare. This repository will be actively ma…☆34Mar 1, 2026Updated 2 months ago
- A collection of AWESOME things about LLM-Centric-Molecular-Discovery.☆26May 20, 2025Updated 11 months ago
- [ICML 2025] Efficiently Serving Large Multimodal Models Using EPD Disaggregation☆24May 29, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This repository contains recent background materials, current works, and codes for researching in TPP.☆16Sep 22, 2023Updated 2 years ago
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- Official Code of our AAAI-24 Paper: "Generative Multi-modal Knowledge Retrieval with Large Language Models".☆28Sep 15, 2025Updated 7 months ago
- ☆10Jul 5, 2023Updated 2 years ago
- ☆17May 25, 2023Updated 2 years ago
- [ICANN 2024 (Oral)] MISS: A Generative Pre-training and Fine-tuning Approach for Med-VQA☆12Aug 8, 2024Updated last year
- ☆76Mar 13, 2026Updated last month
- Code for paper: Are Large Language Models Post Hoc Explainers?☆34Jul 22, 2024Updated last year
- ☆17Jun 15, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This is the official code repository for the paper 'Cross-modality Data Augmentation for End-to-End Sign Language Translation'. Accepted…☆16Oct 18, 2023Updated 2 years ago
- ☆73Nov 14, 2024Updated last year
- 面向可信执行环境的OS。☆12May 9, 2025Updated 11 months ago
- Implementation of CycleGAN for Text style transfer with PyTorch.☆32Sep 8, 2019Updated 6 years ago
- ☆12Mar 5, 2025Updated last year
- The code based on vLLM for the paper “ Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention”.☆11Sep 19, 2024Updated last year
- Autonomous Traversal and Object Detection for Rovers☆16Apr 27, 2026Updated last week