matthewchung74 / qwen_2_5_3B_GRPO_medical_thinkingLinks
☆48Updated 4 months ago
Alternatives and similar repositories for qwen_2_5_3B_GRPO_medical_thinking
Users that are interested in qwen_2_5_3B_GRPO_medical_thinking are comparing it to the libraries listed below
Sorting:
- ChiMed-GPT is a Chinese medical large language model (LLM) built by continually training Ziya-v2 on Chinese medical data, where pre-train…☆99Updated last year
- Beyond the Model: Scaling Medical Capability with a Large Verifier System☆106Updated 2 weeks ago
- [npj digital medicine] The official codes for "Towards Evaluating and Building Versatile Large Language Models for Medicine"☆71Updated 4 months ago
- Adapt an LLM model to a Mixture-of-Experts model using Parameter Efficient finetuning (LoRA), injecting the LoRAs in the FFN.☆56Updated 11 months ago
- A Toolkit for Table-based Question Answering☆112Updated last year
- LLM+RAG for QA☆23Updated last year
- [ICML2025] The official implementation of "C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Gene…☆38Updated 4 months ago
- ☆54Updated last year
- A Chinese National Medical Licensing Examination dataset and large languge model benchmarks☆72Updated last year
- 天池算法比赛《BetterMixture - 大模型数据混合挑战赛》的第一名top1解决方案☆32Updated last year
- CMB, A Comprehensive Medical Benchmark in Chinese☆207Updated 5 months ago
- ☆83Updated last year
- Multilingual Medicine: Model, Dataset, Benchmark, Code☆194Updated 11 months ago
- Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision Support☆151Updated 6 months ago
- ☆205Updated 6 months ago
- [NeurIPS 2024 D&B Track, Spotlight] UltraMedical: Building Specialized Generalists in Biomedicine☆93Updated 11 months ago
- ☆21Updated 3 months ago
- The official GitHub page for the survey paper "A Survey on Data Augmentation in Large Model Era"☆128Updated last year
- Encourage Medical LLM to engage in deep thinking similar to DeepSeek-R1.☆26Updated 4 months ago
- Implementation and evaluation of multimodal RAG with text and image inputs for industrial applications☆62Updated 10 months ago
- LongQLoRA: Extent Context Length of LLMs Efficiently☆166Updated last year
- ☆58Updated 11 months ago
- 多轮共情对话模型PICA☆97Updated 2 years ago
- The official implementation of "LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented…☆43Updated 5 months ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆67Updated 2 years ago
- Specialized LLMs capable of handling various diabetes tasks☆49Updated 3 months ago
- AI Hospital: Interactive Evaluation and Collaboration of LLMs as Intern Doctors for Clinical Diagnosis☆179Updated last year
- The first Chinese medical large vision-language model designed to integrate the analysis of textual and visual data☆62Updated last year
- MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding☆223Updated last month
- ☆39Updated 5 months ago