PennShenLab / MentalChat16KLinks
A benchmark dataset designed to support the development and evaluation of large language models (LLMs) for conversational mental health assistance
☆14Updated 8 months ago
Alternatives and similar repositories for MentalChat16K
Users that are interested in MentalChat16K are comparing it to the libraries listed below
Sorting:
- ☆34Updated 5 months ago
- Repo about the MultiCaRe Dataset, with demo notebooks and details about how it was created.☆61Updated last week
- ☆48Updated 8 months ago
- Agent benchmark for medical diagnosis☆253Updated 10 months ago
- Enhancing Medical Question-Answering System through Advanced Information Retrieval Strategies and Integration of GPT-3.5☆24Updated last month
- [arxiv'25] MedAgentGYM: Training LLM Agents for Code-Based Medical Reasoning at Scale☆65Updated 3 months ago
- [EMNLP 2025] Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards☆46Updated last month
- A code repository that cointains all the code for finetuning some of the popular LLMs on medical data☆64Updated last year
- ☆94Updated 9 months ago
- MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs☆232Updated 4 months ago
- Top papers related to LLM-based agent evaluation☆86Updated 2 weeks ago
- ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning☆78Updated last week
- Code for the MedRAG toolkit☆457Updated 6 months ago
- [ICLR'25] Reasoning-Enhanced Healthcare Predictions with Knowledge Graph Community Retrieval☆49Updated 6 months ago
- ☆128Updated last year
- [EMNLP 2024] Multi-expert Prompting Improves Reliability, Safety and Usefulness of Large Language Models☆37Updated 10 months ago
- Official repository of the MIRAGE benchmark☆179Updated last year
- MedAgentSim: Self-Evolving Multi-Agent Simulations for Realistic Clinical Interactions, MICCAI 2025 (early accepted)☆89Updated 4 months ago
- RuleRAG: Rule Meets Retrieval-Augmented Generation for Question Answering☆27Updated last month
- Clinical NLP Shared Task @ NAACL'24☆35Updated 2 months ago
- [NeurIPS 2024 Datasets and Benchmark Track Oral] MedCalc-Bench: Evaluating Large Language Models for Medical Calculations☆75Updated this week
- Clinical text summarization by adapting large language models☆149Updated last year
- [ACL 2024 Findings] MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning https://arxiv.org/abs/2311.10537☆293Updated last year
- ☆38Updated 5 months ago
- ☆25Updated 7 months ago
- Multilingual Medicine: Model, Dataset, Benchmark, Code☆197Updated last year
- m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models☆43Updated 6 months ago
- [EMNLP'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records☆111Updated 10 months ago
- MIRIAD is a million scale Medical Instruction and RetrIeval Datatset☆128Updated 2 months ago
- ☆39Updated 9 months ago