FreedomIntelligence / HuatuoGPT-Vision
Medical Multimodal LLMs
☆226Updated last week
Alternatives and similar repositories for HuatuoGPT-Vision:
Users that are interested in HuatuoGPT-Vision are comparing it to the libraries listed below
- The official codes for "PMC-CLIP: Contrastive Language-Image Pre-training using Biomedical Documents"☆199Updated 4 months ago
- An interpretable large language model (LLM) for medical diagnosis.☆106Updated 4 months ago
- ClinicalLab: Aligning Agents for Multi-Departmental Clinical Diagnostics in the Real World☆71Updated 5 months ago
- [ICLR 2024] FairSeg: A Large-Scale Medical Image Segmentation Dataset for Fairness Learning Using Segment Anything Model with Fair Error-…☆84Updated 2 weeks ago
- Learning to Use Medical Tools with Multi-modal Agent☆100Updated last week
- PMC-VQA is a large-scale medical visual question-answering dataset, which contains 227k VQA pairs of 149k images that cover various modal…☆188Updated last month
- LeFusion: Controllable Pathology Synthesis via Lesion-Focused Diffusion Models☆62Updated last month
- u-LLaVA: Unifying Multi-Modal Tasks via Large Language Model☆126Updated 6 months ago
- HuatuoGPT2, One-stage Training for Medical Adaption of LLMs. (An Open Medical GPT)☆344Updated 4 months ago
- ☆192Updated this week
- Code for paper: From redundancy to relevance: Enhancing explainability in multimodal large language models☆59Updated this week
- The first Chinese medical large vision-language model designed to integrate the analysis of textual and visual data☆58Updated last year
- [arXiv'24 & NeurIPSW'24] MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models☆85Updated last month
- Code for AAAl 2024 paper: Relax Image-Specific Prompt Requirement in SAM: A Single Generic Prompt for Segmenting Camouflaged Objects☆136Updated 3 months ago
- (ECCV 2024) Empowering Multimodal Large Language Model as a Powerful Data Generator☆101Updated 3 months ago
- [NeurIPS'24] Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable Segmentation☆52Updated last month
- (AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions☆250Updated 9 months ago
- 中文医学多模态大模型 Large Chinese Language-and-Vision Assistant for BioMedicine☆65Updated 7 months ago
- Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models☆157Updated 2 months ago
- [EMNLP'24] Code and data for paper "Med-MoE: Mixture of Domain-Specific Experts for Lightweight Medical Vision-Language Models"☆76Updated 3 weeks ago
- Chain-of-Spot: Interactive Reasoning Improves Large Vision-language Models☆84Updated 9 months ago
- [EMNLP 2023] FreeAL: Towards Human-Free Active Learning in the Era of Large Language Models☆81Updated last year
- The official codes for "Towards Evaluating and Building Versatile Large Language Models for Medicine"☆48Updated 2 months ago
- ☆51Updated last month
- MedLSAM: Localize and Segment Anything Model for 3D Medical Images☆479Updated 8 months ago
- Dataset of paper: On the Compositional Generalization of Multimodal LLMs for Medical Imaging☆27Updated 2 weeks ago
- [ACM MM 2024] SAM-MIL: A Spatial Contextual Aware Multiple Instance Learning Approach for Whole Slide Image Classification☆32Updated last month
- Official implementation of "Towards Efficient Visual Adaption via Structural Re-parameterization".☆177Updated 9 months ago
- [ICLR'24] Democratizing Fine-grained Visual Recognition with Large Language Models☆162Updated 6 months ago
- [EMNLP'24] RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models☆56Updated last month