limafang / agent-arxiv-daily
🎓Automatically Update agent Papers Daily using Github Actions (Update Every 12th hours)
☆29Updated this week
Alternatives and similar repositories for agent-arxiv-daily:
Users that are interested in agent-arxiv-daily are comparing it to the libraries listed below
- 使用单个24G显卡,从0开始训练LLM☆53Updated 6 months ago
- 大语言模型应用:RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛☆61Updated 2 months ago
- ☆38Updated 5 months ago
- The Role Playing Project of Honor-of-Kings Based on LnternLM2。峡谷小狐仙--王者荣耀领域的角色扮演聊天机器人,结合多模态技术将英雄妲己的形象带入大模型中。☆23Updated 9 months ago
- 大型语言模型实战指南:应用实践与场景落地☆68Updated 7 months ago
- 通义千问的DPO训练☆47Updated 7 months ago
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆86Updated 7 months ago
- ☆109Updated 10 months ago
- ☆143Updated 10 months ago
- ☆54Updated 2 months ago
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the…☆92Updated 3 weeks ago
- ☆132Updated 3 months ago
- ☆76Updated this week
- ☆78Updated 2 weeks ago
- Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"☆174Updated 8 months ago
- ☆67Updated last year
- RAG 论文学习☆120Updated last month
- 对llama3进行全参微调、lora微调以及qlora微调。☆193Updated 7 months ago
- 阿里天池: 2023全球智能汽车AI挑战赛——赛道一:AI大模型检索问答 baseline 80+☆102Updated last year
- Agentic RAG R1 Framework via Reinforcement Learning☆130Updated this week
- ☆123Updated last year
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆188Updated 2 months ago
- 基于DPO算法微调语言大模型,简单好上手。☆37Updated 10 months ago
- 中文原生检索增强生成测评基准☆115Updated last year
- ☆109Updated 5 months ago
- something for paper agent☆11Updated 4 months ago
- ☆220Updated last year
- Reinforcement Learning in LLM and NLP.☆35Updated 2 weeks ago
- 本项目是自动化学报中AUTOPLAN的代码地址,使用大语言模型完成了复杂任务的任务规划以及任务执行☆95Updated 5 months ago
- ☆90Updated last year