PennShenLab / MentalChat16KLinks
A benchmark dataset designed to support the development and evaluation of large language models (LLMs) for conversational mental health assistance
☆16Updated 9 months ago
Alternatives and similar repositories for MentalChat16K
Users that are interested in MentalChat16K are comparing it to the libraries listed below
Sorting:
- ☆37Updated 6 months ago
- ☆48Updated 9 months ago
- Agent benchmark for medical diagnosis☆265Updated 11 months ago
- [ML4H'25] m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models☆47Updated 8 months ago
- ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning☆103Updated last month
- A code repository that cointains all the code for finetuning some of the popular LLMs on medical data☆68Updated last year
- A virtual clinical environment for self‑evolving LLM diagnostic agents.☆85Updated 2 weeks ago
- MIRIAD is a million-scale Medical Instruction and Retrieval Datatset☆135Updated 3 weeks ago
- MedAgentSim: Self-Evolving Multi-Agent Simulations for Realistic Clinical Interactions, MICCAI 2025 (oral and early accepted)☆104Updated 3 weeks ago
- MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs☆245Updated 6 months ago
- [EMNLP 2025] Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards☆53Updated 3 months ago
- [arxiv'25] MedAgentGYM: Training LLM Agents for Code-Based Medical Reasoning at Scale☆70Updated 4 months ago
- ☆38Updated 6 months ago
- Enhancing Medical Question-Answering System through Advanced Information Retrieval Strategies☆24Updated 2 months ago
- ☆39Updated 11 months ago
- [ICLR'25] ApolloMoE: Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts☆51Updated last year
- Top papers related to LLM-based agent evaluation☆86Updated 2 months ago
- Repo about the MultiCaRe Dataset, with demo notebooks and details about how it was created.☆67Updated last month
- MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medical LLM Agents☆184Updated last month
- ☆94Updated 10 months ago
- Official codes for EMNLP 2024 paper "Multi-expert Prompting Improves Reliability, Safety and Usefulness of Large Language Models"☆37Updated last year
- [ACL 2024 Findings] MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning https://arxiv.org/abs/2311.10537☆303Updated last year
- Clinical text summarization by adapting large language models☆150Updated last year
- ☆129Updated last year
- Large language model of Medical AI, General Medical AI (GMAI)☆17Updated last year
- [ACL 2025] Exploring Compositional Generalization of Multimodal LLMs for Medical Imaging☆38Updated 6 months ago
- Code for MedCPT, a model for zero-shot biomedical information retrieval.☆221Updated last year
- [ICML'25] MedTok: Multimodal Medical Code Tokenizer☆32Updated 5 months ago
- MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning☆67Updated 2 months ago
- [NeurIPS 2024 D&B Track, Spotlight] UltraMedical: Building Specialized Generalists in Biomedicine☆94Updated last year