limafang / agent-arxiv-dailyLinks
🎓Automatically Update agent Papers Daily using Github Actions (Update Every 12th hours)每日更新agent相关论文(已附带中文摘要翻译)
☆31Updated this week
Alternatives and similar repositories for agent-arxiv-daily
Users that are interested in agent-arxiv-daily are comparing it to the libraries listed below
Sorting:
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆95Updated last year
- ☆42Updated 4 months ago
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆256Updated 7 months ago
- ☆87Updated 4 months ago
- ☆160Updated 8 months ago
- ☆231Updated last year
- 大语言模型应用:RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛☆69Updated 7 months ago
- 使用单个24G显卡,从0开始训练LLM☆55Updated 2 months ago
- 通义千问的DPO训练☆55Updated last year
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the…☆160Updated 3 months ago
- An Awesome List of Agentic Model trained with Reinforcement Learning☆489Updated 2 weeks ago
- A small open source 3D agent simulator based on LLM.☆67Updated 10 months ago
- A live reading list for LLM data synthesis (Updated to July, 2025).☆378Updated last month
- PC Agent: While You Sleep, AI Works - A Cognitive Journey into Digital World☆284Updated 4 months ago
- ☆356Updated 3 months ago
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆358Updated last week
- llm & rl☆222Updated 2 weeks ago
- ☆84Updated 8 months ago
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆158Updated 6 months ago
- Reinforcement Learning in LLM and NLP.☆61Updated 3 weeks ago
- 在verl上做reward的定制开发☆118Updated 4 months ago
- This is the repository for the Tool Learning survey.☆438Updated last month
- ☆125Updated last year
- The Role Playing Project of Honor-of-Kings Based on LnternLM2。峡谷小狐仙--王者荣耀领域的角色扮演聊天机器人,结合多模态技术将英雄妲己的形象带入大模型中。☆25Updated last year
- ☆408Updated last month
- ☆354Updated 2 months ago
- a-m-team's exploration in large language modeling☆188Updated 4 months ago
- ☆115Updated 10 months ago
- AN O1 REPLICATION FOR CODING☆335Updated 9 months ago
- 这是一个open-r1的复现项目,对0.5B、1.5B、3B、7B的qwen模型进行GRPO训练,观察到一些有趣的现象。☆46Updated 5 months ago