matthewchung74 / qwen_2_5_3B_GRPO_medical_thinking
☆31Updated 3 weeks ago
Alternatives and similar repositories for qwen_2_5_3B_GRPO_medical_thinking:
Users that are interested in qwen_2_5_3B_GRPO_medical_thinking are comparing it to the libraries listed below
- A Toolkit for Table-based Question Answering☆110Updated last year
- 1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc☆162Updated last year
- ☆53Updated 5 months ago
- LLM+RAG for QA☆21Updated last year
- [SIGIR'24] The official implementation code of MOELoRA.☆154Updated 8 months ago
- 天池算法比赛《BetterMixture - 大模型数据混合挑战赛》的第一名top1解决方案☆27Updated 8 months ago
- ChiMed-GPT is a Chinese medical large language model (LLM) built by continually training Ziya-v2 on Chinese medical data, where pre-train…☆89Updated last year
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆115Updated 4 months ago
- [preprint] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆43Updated 3 months ago
- A Chinese National Medical Licensing Examination dataset and large languge model benchmarks☆60Updated last year
- ☆23Updated 5 months ago
- 使用单个24G显卡,从0开始训练LLM☆50Updated 5 months ago
- ☆40Updated 7 months ago
- LongQLoRA: Extent Context Length of LLMs Efficiently☆164Updated last year
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆42Updated 9 months ago
- ☆142Updated 9 months ago
- ☆66Updated last year
- ☆18Updated 3 weeks ago
- ☆94Updated 3 months ago
- ☆81Updated last year
- Build a simple basic multimodal large model from scratch. 从零搭建一个简单的基础多模态大模型🤖☆34Updated 9 months ago
- ☆216Updated 11 months ago
- ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory☆71Updated last week
- ☆60Updated 2 months ago
- ☆51Updated 6 months ago
- ☆45Updated 9 months ago
- The official GitHub page for the survey paper "A Survey on Data Augmentation in Large Model Era"☆124Updated 8 months ago
- Code for a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models☆62Updated last month
- made RAG pipeline better in table data☆41Updated 5 months ago
- This is the code repo for our paper "Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts".☆28Updated 2 weeks ago