CogStack / OpenGPT
A framework for creating grounded instruction based datasets and training conversational domain expert Large Language Models (LLMs).
☆331Updated last year
Related projects: ⓘ
- Code and data for MedQA☆195Updated last year
- LLM finetuned for medical question answering☆474Updated last year
- Code for the MedRAG toolkit☆162Updated this week
- ☆203Updated 3 months ago
- A large-scale (194k), Multiple-Choice Question Answering (MCQA) dataset designed to address realworld medical entrance exam questions.☆164Updated last year
- MedAlign is a clinician-generated dataset for instruction following with electronic medical records.☆87Updated 11 months ago
- Code for MedCPT, a model for zero-shot biomedical information retrieval.☆124Updated 5 months ago
- The official codes for "PMC-LLaMA: Towards Building Open-source Language Models for Medicine"☆577Updated 2 months ago
- ☆375Updated last year
- The official codes for "Towards Building Multilingual Language Model for Medicine"☆155Updated 2 months ago
- Multilingual Medicine: Model, Dataset, Benchmark, Code☆157Updated last week
- Medical Graph RAG: Graph RAG for the Medical Data☆135Updated this week
- Clinical text summarization by adapting large language models☆111Updated last month
- A curated list of popular Datasets, Models and Papers for LLMs in Medical/Healthcare☆143Updated 3 months ago
- Curated papers on Large Language Models in Healthcare and Medical domain☆198Updated last month
- Open-Source Implementation of WizardLM to turn documents into Q:A pairs for LLM fine-tuning☆295Updated last year
- The paper list of the review on LLMs in medicine - "Large Language Models Illuminate a Progressive Pathway to Artificial Healthcare Assis…☆191Updated 8 months ago
- Agent benchmark for medical diagnosis☆80Updated last week
- Official repository of the MIRAGE benchmark☆82Updated last month
- Implementation of ChatGPT, but tailored towards primary care medicine, with the reward being able to collect patient histories in a thoro…☆313Updated 11 months ago
- PubMedQA: A Dataset for Biomedical Research Question Answering☆238Updated last year
- A novel medical large language model family with 13/70B parameters, which have SOTA performances on various medical tasks☆100Updated 2 months ago
- Fine-Tuning Embedding for RAG with Synthetic Data☆456Updated last year
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆362Updated 7 months ago
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…☆859Updated 4 months ago
- Data and code for FreshLLMs (https://arxiv.org/abs/2310.03214)☆316Updated this week
- ☆598Updated last year
- Generate textbook-quality synthetic LLM pretraining data☆479Updated 11 months ago
- ☆51Updated last year
- Forward-Looking Active REtrieval-augmented generation (FLARE)☆575Updated 10 months ago