ljc010717 / GRPO2025Links
☆23Updated 7 months ago
Alternatives and similar repositories for GRPO2025
Users that are interested in GRPO2025 are comparing it to the libraries listed below
Sorting:
- A live reading list for LLM data synthesis (Updated to July, 2025).☆420Updated 3 months ago
- 对llama3进行全参微调、lora微调以及qlora微调。☆211Updated last year
- RAG 论文学习☆180Updated 8 months ago
- llm & rl☆258Updated last month
- kaggle 2024 Eedi 第10名 金牌方案☆43Updated 11 months ago
- ☆175Updated last year
- ☆105Updated 6 months ago
- 该仓库主要记录 LLMs 算法工程师相关的顶会论文研读笔记(多模态、PEFT、小样本QA问答、RAG、LMMs可解释性、Agents、CoT)☆370Updated last year
- 在verl上做reward的定制开发☆132Updated 6 months ago
- ☆32Updated last year
- A Survey on Multimodal Retrieval-Augmented Generation☆438Updated last month
- This repository collects awesome survey, resource, and paper for Lifelong Learning for Large Language Models. (Updated Regularly)☆68Updated 6 months ago
- WWW2025 Multimodal Intent Recognition for Dialogue Systems Challenge☆129Updated last year
- Reinforcement Learning in LLM and NLP.☆61Updated 3 months ago
- 快速入门RAG与私有化部署☆211Updated last year
- ☆60Updated last year
- 《EasyOffer》(<大模型面经合集>)是针对LLM宝宝们量身打造的大模型暑期实习Offer指南,主要记录大模型暑期实习和秋招准备的一些常见大厂手撕代码、大厂面经经验、常见大厂思考题等;小白一个,正在学习ing......有问题各位大佬随时指正,希望大家都能拿到心仪Of…☆592Updated 8 months ago
- Awesome List for Agentic RL☆585Updated this week
- personal chatgpt☆396Updated 11 months ago
- ☆55Updated 6 months ago
- ☆551Updated 11 months ago
- ☆64Updated 7 months ago
- This is the repository for the Tool Learning survey.☆461Updated 4 months ago
- A One-Stop Reward Model Platform☆101Updated this week
- 大语言模型应用:RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛☆74Updated 10 months ago
- ☆269Updated last year
- ☆85Updated 10 months ago
- 大模型进阶面经☆87Updated 7 months ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆409Updated 5 months ago
- awesome LLM papers! 🚀 🚀 🚀☆32Updated 5 months ago