limafang / agent-arxiv-dailyLinks
🎓Automatically Update agent Papers Daily using Github Actions (Update Every 12th hours)每日更新agent相关论文(已附带中文摘要翻译)
☆31Updated this week
Alternatives and similar repositories for agent-arxiv-daily
Users that are interested in agent-arxiv-daily are comparing it to the libraries listed below
Sorting:
- ☆46Updated 8 months ago
- 大语言模型应用:RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛☆74Updated 11 months ago
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆99Updated last year
- 通义千问的DPO训练☆61Updated last year
- 使用单个24G显卡,从0开始训练LLM☆56Updated 6 months ago
- ☆64Updated 8 months ago
- ☆125Updated last year
- The Role Playing Project of Honor-of-Kings Based on LnternLM2。峡谷小狐仙--王者荣耀领域的角色扮演聊天机器人,结合多模态技术将英雄妲己的形象带入大模型中。☆27Updated last year
- ☆161Updated 11 months ago
- A small open source 3D agent simulator based on LLM.☆68Updated last year
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆278Updated 10 months ago
- Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"☆196Updated last year
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the…☆182Updated 6 months ago
- ☆268Updated last year
- ☆115Updated last year
- OpenJudge: A Unified Framework for Holistic Evaluation and Quality Rewards☆117Updated this week
- ☆234Updated last year
- ☆101Updated 2 months ago
- ☆85Updated 11 months ago
- something for paper agent☆11Updated last year
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆168Updated 9 months ago
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆94Updated last month
- A live reading list for LLM data synthesis (Updated to July, 2025).☆435Updated 4 months ago
- Reinforcement Learning in LLM and NLP.☆62Updated last week
- llm & rl☆266Updated 2 months ago
- ☆404Updated 2 months ago
- Awesome papers for role-playing with language models☆215Updated last year
- ☆121Updated 2 years ago
- ☆119Updated last year
- Official Code for "Coser: Coordinating LLM-Based Persona Simulation of Established Roles"☆159Updated 2 weeks ago