limafang / agent-arxiv-dailyLinks
🎓Automatically Update agent Papers Daily using Github Actions (Update Every 12th hours)每日更新agent相关论文(已附带中文摘要翻译)
☆31Updated this week
Alternatives and similar repositories for agent-arxiv-daily
Users that are interested in agent-arxiv-daily are comparing it to the libraries listed below
Sorting:
- ☆45Updated 7 months ago
- 通义千问的DPO训练☆60Updated last year
- ☆98Updated last month
- Official code for Dynamic Parametric RAG.☆166Updated 4 months ago
- 大语言模型应用:RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛☆74Updated 10 months ago
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the…☆178Updated 5 months ago
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆98Updated last year
- A small open source 3D agent simulator based on LLM.☆67Updated last year
- ☆162Updated 11 months ago
- ☆268Updated last year
- ☆64Updated 7 months ago
- [2025-上海人工智能实验室书生实训营十佳、优秀项目]☆40Updated 2 months ago
- Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"☆195Updated last year
- ☆124Updated 2 months ago
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆93Updated last month
- Fine-Tuning Dataset Auto-Generation for Graph Query Languages.☆84Updated last month
- 使用单个24G显卡,从0开始训练LLM☆55Updated 5 months ago
- 本项目是自动化学报中AUTOPLAN的代码地址,使用大语言模型完成了复杂任务的长程工具调用☆113Updated 2 months ago
- Agentic RAG R1 Framework via Reinforcement Learning☆355Updated this week
- ☆85Updated 10 months ago
- Reinforcement Learning in LLM and NLP.☆61Updated 3 months ago
- ☆234Updated last year
- Zero-human, cold-start construction of long-chain agents in professional domains☆45Updated last month
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆165Updated 8 months ago
- something for paper agent☆11Updated last year
- ☆28Updated 5 months ago
- llm & rl☆261Updated last month
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆274Updated 10 months ago
- Code and data for QueryAgent(ACL 2024)☆20Updated last year
- MetaSearch:llm深度研究(deepsearch)功能方案实现☆34Updated 4 months ago