limafang / agent-arxiv-dailyLinks
🎓Automatically Update agent Papers Daily using Github Actions (Update Every 12th hours)每日更新agent相关论文(已附带中文摘要翻译)
☆31Updated this week
Alternatives and similar repositories for agent-arxiv-daily
Users that are interested in agent-arxiv-daily are comparing it to the libraries listed below
Sorting:
- 大语言模型应用:RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛☆74Updated 11 months ago
- ☆46Updated 8 months ago
- 通义千问的DPO训练☆60Updated last year
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆94Updated 2 months ago
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆99Updated last year
- ☆162Updated last year
- llm & rl☆268Updated 3 months ago
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the…☆184Updated 6 months ago
- Reinforcement Learning in LLM and NLP.☆62Updated last month
- 使用单个24G显卡,从0开始训练LLM☆56Updated 6 months ago
- ☆235Updated last year
- ☆421Updated 3 months ago
- 大型语言模型实战指南:应用实践与场景落地☆86Updated last year
- A small open source 3D agent simulator based on LLM.☆69Updated last year
- The Role Playing Project of Honor-of-Kings Based on LnternLM2。峡谷小狐仙--王者荣耀领域的角色扮演聊天机器人,结合多模态技术将英雄妲己的形象带入大模型中。☆27Updated last year
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆281Updated 11 months ago
- ☆103Updated 3 months ago
- Agentic RAG R1 Framework via Reinforcement Learning☆374Updated 2 weeks ago
- ☆489Updated 3 months ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆312Updated 3 weeks ago
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆172Updated 10 months ago
- Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"☆198Updated last year
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆677Updated 5 months ago
- ☆121Updated 2 years ago
- ☆63Updated 8 months ago
- 在verl上做reward的定制开发☆144Updated 8 months ago
- ☆125Updated last year
- LLaMA Factory Document☆164Updated this week
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆691Updated 3 months ago
- Official Code for "Coser: Coordinating LLM-Based Persona Simulation of Established Roles"☆166Updated last month