openmedlab / PULSE-EVAL
PULSE-EVAL
☆15Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for PULSE-EVAL
- CMB, A Comprehensive Medical Benchmark in Chinese☆134Updated 7 months ago
- Biomedical LLM, A Bilingual (Chinese and English) Fine-Tuned Large Language Model for Diverse Biomedical Tasks☆141Updated last month
- The Largest-scale Chinese Medical QA Dataset: with 26,000,000 question answer pairs.☆224Updated 8 months ago
- A Chinese medical ChatGPT based on LLaMa, training from large-scale pretrain corpus and multi-turn dialogue dataset.☆317Updated 11 months ago
- PromptCBLUE: a large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain in Chinese☆325Updated 9 months ago
- A curated list of popular Datasets, Models and Papers for LLMs in Medical/Healthcare☆171Updated 5 months ago
- 用于大模型 RLHF 进行人工数据标注排序的工具。A tool for manual response data annotation sorting in RLHF stage.☆242Updated last year
- Deepspeed、LLM、Medical_Dialogue、医疗大模型、预训练、微调☆231Updated 5 months ago
- 各大顶会医疗领域NLP论文与资源。NLP papers and resources in the medical field.☆104Updated last year
- ☆91Updated 11 months ago
- A Chinese National Medical Licensing Examination dataset and large languge model benchmarks☆49Updated 11 months ago
- ☆15Updated 6 months ago
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆87Updated 7 months ago
- This is the repo of the medical dialogue dataset 'imcs21' in CBLUE@Tianchi☆82Updated last year
- ☆35Updated 5 months ago
- Code and dataset for our Bioinformatics 2022 paper: "A Benchmark for Automatic Medical Consultation System: Frameworks, Tasks and Datase…☆55Updated last year
- ChiMed-GPT is a Chinese medical large language model (LLM) built by continually training Ziya-v2 on Chinese medical data, where pre-train…☆74Updated 10 months ago
- Python ROUGE Score Implementation for Chinese Language Task (official rouge score)☆82Updated 4 months ago
- ☆157Updated last year
- LAiW: A Chinese Legal Large Language Models Benchmark☆72Updated 4 months ago
- ☆158Updated last year
- ☆120Updated 7 months ago
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆67Updated last week
- llm-medical-data:用于大模型微调训练的医疗数据集☆70Updated last year
- Viscacha:通用信息抽取数据集收集☆25Updated 9 months ago
- RAGOnMedicalKG,将大模型RAG与KG结合,完成demo级问答,旨在给出基础的思路。☆206Updated 7 months ago
- A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark☆99Updated last year
- 🛰️ 基于真实医疗对话数据在ChatGLM上进行LoRA、P-Tuning V2、Freeze、RLHF等微调,我们的眼光不止于医 疗问答☆301Updated last year
- llama2 finetuning with deepspeed and lora☆167Updated last year
- 中文大语言模型评测第二期☆70Updated last year