826568389 / GRPO-R1
☆11Updated last week
Alternatives and similar repositories for GRPO-R1:
Users that are interested in GRPO-R1 are comparing it to the libraries listed below
- GoGPT中文指令数据集构造☆10Updated last year
- 通用简单工具项目☆17Updated 5 months ago
- Recursive Abstractive Processing for Tree-Organized Retrieval☆11Updated 9 months ago
- ☆21Updated 9 months ago
- KDD 2024 AQA competition 2nd place solution☆11Updated 8 months ago
- BLOOM 模型的指令微调☆24Updated last year
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆62Updated 8 months ago
- 通义千问的DPO 训练☆40Updated 6 months ago
- LLM+RAG for QA☆22Updated last year
- 2023全球智能汽车AI挑战赛——赛道一:AI大模型检索问答, 75+ baseline☆56Updated last year
- 大语言模型训练和服务调研☆37Updated last year
- 一套代码指令微调大模型☆38Updated last year
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆55Updated 10 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated 11 months ago
- ☆18Updated 8 months ago
- 大语言模型应用:RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛☆58Updated last month
- 基于 LoRA 和 P-Tuning v2 的 ChatGLM-6B 高效参数微调☆54Updated last year
- 用于AIOPS24挑战赛的Demo☆61Updated 9 months ago
- [ACL 2024 Findings] Learning Fine-Grained Grounded Citations for Attributed Large Language Models☆16Updated 5 months ago
- meta-comprehensive-rag-benchmark-kdd-cup-2024 phase1 task1 rank3☆17Updated 9 months ago
- ☆33Updated 3 months ago
- ☆12Updated 8 months ago
- LLM RAG 应用,支持 API 调用,语音交互。☆11Updated 9 months ago
- accelerate generating vector by using onnx model☆15Updated last year
- 天池算法比赛《BetterMixture - 大模型数据混合挑战赛》的第一名top1解决方案☆27Updated 8 months ago
- ☆142Updated 8 months ago
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆42Updated 9 months ago
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Updated 11 months ago
- 文言文信息抽取(实体识别+关系抽取)☆9Updated 2 years ago
- aigc evals☆10Updated last year