williamliujl / CMExam
A Chinese National Medical Licensing Examination dataset and large languge model benchmarks
☆60Updated last year
Alternatives and similar repositories for CMExam:
Users that are interested in CMExam are comparing it to the libraries listed below
- CMB, A Comprehensive Medical Benchmark in Chinese☆169Updated this week
- Biomedical LLM, A Bilingual (Chinese and English) Fine-Tuned Large Language Model for Diverse Biomedical Tasks☆147Updated 5 months ago
- ☆15Updated 9 months ago
- Data and baseline code of EMNLP 2021 paper "MLEC-QA: A Chinese Multi-Choice Biomedical Question Answering Dataset".☆25Updated 3 years ago
- ☆38Updated 2 weeks ago
- ChiMed-GPT is a Chinese medical large language model (LLM) built by continually training Ziya-v2 on Chinese medical data, where pre-train…☆89Updated last year
- Code and dataset for our Bioinformatics 2022 paper: "A Benchmark for Automatic Medical Consultation System: Frameworks, Tasks and Datase…☆55Updated 2 years ago
- The first Chinese Multimodal Medical Knowledge Graph☆35Updated 2 years ago
- A large Chinese Medical CQA☆56Updated last year
- KG-Rank: Enhancing Large Language Models for Medical QA with Knowledge Graphs and Ranking Techniques☆38Updated 3 months ago
- A Chinese medical question answering dataset☆62Updated 5 years ago
- ☆80Updated last year
- LAiW: A Chinese Legal Large Language Models Benchmark☆78Updated 8 months ago
- Official repository of the MIRAGE benchmark☆121Updated 4 months ago
- [NeurIPS 2024 Datasets and Benchmark Track Oral] MedCalc-Bench: Evaluating Large Language Models for Medical Calculations☆46Updated 2 months ago
- A Toolkit for Table-based Question Answering☆110Updated last year
- This is the repo of the medical dialogue dataset 'imcs21' in CBLUE@Tianchi☆83Updated 2 years ago
- Adapt an LLM model to a Mixture-of-Experts model using Parameter Efficient finetuning (LoRA), injecting the LoRAs in the FFN.☆27Updated 5 months ago
- [ICLR24] The open-source repo of THU-KEG's KoLA benchmark.☆50Updated last year
- Diaformer: Automatic Diagnosis via Symptoms Sequence Generation☆26Updated last year
- PromptCBLUE: a large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain in Chinese☆347Updated last year
- A Chinese medical ChatGPT based on LLaMa, training from large-scale pretrain corpus and multi-turn dialogue dataset.☆342Updated last year
- ☆95Updated last year
- [ACL 2024] CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling☆96Updated last week
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆88Updated 11 months ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆120Updated 9 months ago
- ☆37Updated 6 months ago
- Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)☆47Updated 10 months ago
- ☆21Updated last year