PennShenLab / MentalChat16KLinks
A benchmark dataset designed to support the development and evaluation of large language models (LLMs) for conversational mental health assistance
☆11Updated 6 months ago
Alternatives and similar repositories for MentalChat16K
Users that are interested in MentalChat16K are comparing it to the libraries listed below
Sorting:
- Agent benchmark for medical diagnosis☆222Updated 8 months ago
- ☆48Updated 6 months ago
- ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning☆67Updated last month
- ☆31Updated 3 months ago
- [arxiv'25] MedAgentGYM: Training LLM Agents for Code-Based Medical Reasoning at Scale☆55Updated 3 weeks ago
- MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs☆208Updated 2 months ago
- ☆91Updated 6 months ago
- Repo about the MultiCaRe Dataset, with demo notebooks and details about how it was created.☆52Updated last month
- Code for the MedRAG toolkit☆419Updated 3 months ago
- ☆38Updated 3 months ago
- Clinical text summarization by adapting large language models☆149Updated last year
- A code repository that cointains all the code for finetuning some of the popular LLMs on medical data☆61Updated last year
- ☆124Updated last year
- [EMNLP 2025] Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards☆42Updated last week
- Code for MedCPT, a model for zero-shot biomedical information retrieval.☆200Updated last year
- Top papers related to LLM-based agent evaluation☆75Updated last week
- Clinical NLP Shared Task @ NAACL'24☆35Updated last week
- MedAgentSim: Self-Evolving Multi-Agent Simulations for Realistic Clinical Interactions, MICCAI 2025 (early accepted)☆80Updated last month
- ☆28Updated 9 months ago
- [ACL 2024 Findings] MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning https://arxiv.org/abs/2311.10537☆280Updated last year
- m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models☆41Updated 4 months ago
- For Med-Gemini, we relabeled the MedQA benchmark; this repo includes the annotations and analysis code.☆57Updated last year
- ToolUniverse is a collection of biomedical tools designed for AI agents☆203Updated last month
- [NeurIPS 2024 D&B Track, Spotlight] UltraMedical: Building Specialized Generalists in Biomedicine☆93Updated 11 months ago
- Curated papers on Large Language Models in Healthcare and Medical domain☆350Updated 3 months ago
- [npj digital medicine] The official codes for "Towards Evaluating and Building Versatile Large Language Models for Medicine"☆71Updated 3 months ago
- Enhancing Medical Question-Answering System through Advanced Information Retrieval Strategies and Integration of GPT-3.5☆18Updated 2 months ago
- Multilingual Medicine: Model, Dataset, Benchmark, Code☆194Updated 10 months ago
- Official repository of the MIRAGE benchmark☆165Updated 9 months ago
- ☆25Updated last year