limafang / agent-arxiv-dailyLinks
🎓Automatically Update agent Papers Daily using Github Actions (Update Every 12th hours)
☆30Updated this week
Alternatives and similar repositories for agent-arxiv-daily
Users that are interested in agent-arxiv-daily are comparing it to the libraries listed below
Sorting:
- ☆40Updated 3 weeks ago
- The Role Playing Project of Honor-of-Kings Based on LnternLM2。峡谷小狐仙--王者荣耀领域的角色扮演聊天机器人,结合多模态技术将英雄妲己的形象带入大模型中。☆24Updated 10 months ago
- Reinforcement Learning in LLM and NLP.☆36Updated 3 weeks ago
- 使用单个24G显卡,从0开始训练LLM☆54Updated last week
- ☆140Updated 4 months ago
- 通义千问的DPO训练☆48Updated 8 months ago
- 大型语言模型实战指南:应用实践与场景落地☆71Updated 8 months ago
- Agentic RAG R1 Framework via Reinforcement Learning☆191Updated last week
- A small open source 3D agent simulator based on LLM.☆65Updated 6 months ago
- ☆83Updated last month
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆82Updated 2 months ago
- ☆81Updated this week
- Train your grpo with zero dataset and low resources, 8bit/4bit/lora/qlora supported, multi-gpu supported ...☆71Updated last month
- ☆69Updated last year
- ☆110Updated 11 months ago
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the…☆99Updated 3 weeks ago
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆87Updated 8 months ago
- ☆123Updated last year
- ☆91Updated last year
- 顾名思义:手搓的RAG☆123Updated last year
- ☆142Updated 11 months ago
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆203Updated 3 months ago
- ☆63Updated last month
- ☆221Updated last year
- 阿里天池: 2023全球智能汽车AI挑战赛——赛道一:AI大模型检索问答 baseline 80+☆105Updated last year
- ☆108Updated 6 months ago
- ☆92Updated 2 months ago
- 中文原生检索增强生成测评基准☆117Updated last year
- 大语言模型应用:RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛☆62Updated 3 months ago
- 1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc☆160Updated last year