matthewchung74 / qwen_2_5_3B_GRPO_medical_thinkingLinks
☆47Updated 9 months ago
Alternatives and similar repositories for qwen_2_5_3B_GRPO_medical_thinking
Users that are interested in qwen_2_5_3B_GRPO_medical_thinking are comparing it to the libraries listed below
Sorting:
- [npj digital medicine] The official codes for "Towards Evaluating and Building Versatile Large Language Models for Medicine"☆75Updated 8 months ago
- ChiMed-GPT is a Chinese medical large language model (LLM) built by continually training Ziya-v2 on Chinese medical data, where pre-train…☆104Updated 2 years ago
- A Toolkit for Table-based Question Answering☆115Updated 2 years ago
- Adapt an LLM model to a Mixture-of-Experts model using Parameter Efficient finetuning (LoRA), injecting the LoRAs in the FFN.☆81Updated 3 months ago
- Multilingual Medicine: Model, Dataset, Benchmark, Code☆198Updated last year
- A Chinese National Medical Licensing Examination dataset and large languge model benchmarks☆83Updated 2 years ago
- Encourage Medical LLM to engage in deep thinking similar to DeepSeek-R1.☆26Updated 9 months ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆45Updated 11 months ago
- 1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc☆162Updated 6 months ago
- [EMNLP'25] Code for paper "MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning"☆66Updated 9 months ago
- [NeurIPS 2024 D&B Track, Spotlight] UltraMedical: Building Specialized Generalists in Biomedicine☆94Updated last year
- ☆54Updated last year
- [ICML2025] The official implementation of "C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Gene…☆41Updated 8 months ago
- Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision Support☆174Updated 10 months ago
- ☆84Updated 2 years ago
- The first Chinese medical large vision-language model designed to integrate the analysis of textual and visual data☆64Updated 2 years ago
- The official GitHub page for the survey paper "A Survey on Data Augmentation in Large Model Era"☆132Updated last year
- CMB, A Comprehensive Medical Benchmark in Chinese☆229Updated 10 months ago
- Beyond the Model: Scaling Medical Capability with a Large Verifier System☆196Updated 4 months ago
- LLM+RAG for QA☆22Updated 2 years ago
- The huggingface implementation of Fine-grained Late-interaction Multi-modal Retriever.☆105Updated 8 months ago
- [ACL Oral 2025] The official GitHub repository for TC-RAG (Turing-Complete RAG)☆74Updated 11 months ago
- ☆214Updated 11 months ago
- This project aims to collect and collate various datasets for multimodal large model training, including but not limited to pre-training …☆66Updated 8 months ago
- 数据合成工具,简单高效的合成不同业务场景的大模型训练数据☆38Updated last year
- ☆58Updated last year
- LongQLoRA: Extent Context Length of LLMs Efficiently☆168Updated 2 years ago
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆64Updated last month
- ☆235Updated last year
- ☆147Updated last year