matthewchung74 / qwen_2_5_3B_GRPO_medical_thinkingLinks
☆46Updated 2 months ago
Alternatives and similar repositories for qwen_2_5_3B_GRPO_medical_thinking
Users that are interested in qwen_2_5_3B_GRPO_medical_thinking are comparing it to the libraries listed below
Sorting:
- A Chinese National Medical Licensing Examination dataset and large languge model benchmarks☆66Updated last year
- A Toolkit for Table-based Question Answering☆112Updated last year
- The first Chinese medical large vision-language model designed to integrate the analysis of textual and visual data☆61Updated last year
- ChiMed-GPT is a Chinese medical large language model (LLM) built by continually training Ziya-v2 on Chinese medical data, where pre-train…☆94Updated last year
- 1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc☆160Updated last year
- ☆82Updated last year
- [NeurIPS 2024 D&B Track, Spotlight] UltraMedical: Building Specialized Generalists in Biomedicine☆88Updated 8 months ago
- [ICML2025] The official implementation of "C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Gene…☆23Updated last month
- LLM+RAG for QA☆22Updated last year
- Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision Support☆133Updated 3 months ago
- [npj digital medicine] The official codes for "Towards Evaluating and Building Versatile Large Language Models for Medicine"☆67Updated last month
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆44Updated last year
- Code for paper "MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning"☆50Updated 2 months ago
- Encourage Medical LLM to engage in deep thinking similar to DeepSeek-R1.☆25Updated 2 months ago
- ☆66Updated 5 months ago
- ☆42Updated 4 months ago
- ☆142Updated 11 months ago
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆81Updated 7 months ago
- ☆57Updated 8 months ago
- CMB, A Comprehensive Medical Benchmark in Chinese☆200Updated 2 months ago
- ☆53Updated 9 months ago
- LAiW: A Chinese Legal Large Language Models Benchmark☆80Updated 11 months ago
- 基于baichuan-7b的开源多模态大语言模型☆73Updated last year
- KDD 2024 AQA competition 2nd place solution☆11Updated 11 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- Adapt an LLM model to a Mixture-of-Experts model using Parameter Efficient finetuning (LoRA), injecting the LoRAs in the FFN.☆41Updated 8 months ago
- ☆36Updated 2 months ago
- ☆36Updated 9 months ago
- [EMNLP 2023 Demo] "CLEVA: Chinese Language Models EVAluation Platform" [ACL 2025 Findings] "C2LEVA: Toward Comprehensive and Contaminatio…☆63Updated last month
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆45Updated 4 months ago