matthewchung74 / qwen_2_5_3B_GRPO_medical_thinkingLinks
☆48Updated 5 months ago
Alternatives and similar repositories for qwen_2_5_3B_GRPO_medical_thinking
Users that are interested in qwen_2_5_3B_GRPO_medical_thinking are comparing it to the libraries listed below
Sorting:
- Multilingual Medicine: Model, Dataset, Benchmark, Code☆196Updated 11 months ago
- ChiMed-GPT is a Chinese medical large language model (LLM) built by continually training Ziya-v2 on Chinese medical data, where pre-train…☆99Updated last year
- A Toolkit for Table-based Question Answering☆113Updated last year
- Beyond the Model: Scaling Medical Capability with a Large Verifier System☆121Updated last month
- [npj digital medicine] The official codes for "Towards Evaluating and Building Versatile Large Language Models for Medicine"☆72Updated 5 months ago
- A Chinese National Medical Licensing Examination dataset and large languge model benchmarks☆74Updated last year
- Adapt an LLM model to a Mixture-of-Experts model using Parameter Efficient finetuning (LoRA), injecting the LoRAs in the FFN.☆58Updated 11 months ago
- Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision Support☆160Updated 7 months ago
- [NeurIPS 2024 D&B Track, Spotlight] UltraMedical: Building Specialized Generalists in Biomedicine☆92Updated last year
- LongQLoRA: Extent Context Length of LLMs Efficiently☆166Updated last year
- [EMNLP'25] Code for paper "MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning"☆58Updated 5 months ago
- LLM+RAG for QA☆23Updated last year
- The first Chinese medical large vision-language model designed to integrate the analysis of textual and visual data☆63Updated last year
- The official GitHub page for the survey paper "A Survey on Data Augmentation in Large Model Era"☆129Updated last year
- CMB, A Comprehensive Medical Benchmark in Chinese☆209Updated 6 months ago
- 1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc☆161Updated 2 months ago
- ☆73Updated 8 months ago
- Encourage Medical LLM to engage in deep thinking similar to DeepSeek-R1.☆26Updated 5 months ago
- ☆57Updated last month
- ☆84Updated last year
- ☆147Updated last year
- ☆54Updated last year
- ☆205Updated 7 months ago
- Official repository of the MIRAGE benchmark☆171Updated 11 months ago
- 中文医学多模态大模型 Large Chinese Language-and-Vision Assistant for BioMedicine☆95Updated last year
- AI Hospital: Interactive Evaluation and Collaboration of LLMs as Intern Doctors for Clinical Diagnosis☆181Updated last year
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆89Updated 10 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆132Updated 11 months ago
- ☆58Updated 11 months ago
- [ICML2025] The official implementation of "C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Gene…☆40Updated 5 months ago