KMnO4-zx / paper-agentLinks
something for paper agent
☆11Updated 5 months ago
Alternatives and similar repositories for paper-agent
Users that are interested in paper-agent are comparing it to the libraries listed below
Sorting:
- 🎓Automatically Update agent Papers Daily using Github Actions (Update Every 12th hours)☆30Updated this week
- ☆22Updated 3 months ago
- MLLM @ Game☆14Updated 3 weeks ago
- 通义千问的DPO训练☆48Updated 8 months ago
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆59Updated 9 months ago
- Pretrain、decay、SFT a CodeLLM from scratch 🧙♂️☆35Updated last year
- ☆40Updated 3 weeks ago
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the…☆99Updated 3 weeks ago
- ☆121Updated last week
- Agentic Workflow - Daily Track on Arxiv.org Paper☆44Updated 3 months ago
- The Role Playing Project of Honor-of-Kings Based on LnternLM2。峡谷小狐仙--王者荣耀领域的角色扮演聊天机器人,结合多模态技术将英雄妲己的形象带入大模型中。☆24Updated 10 months ago
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆73Updated this week
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆87Updated 8 months ago
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆82Updated 2 months ago
- 利用多Agent对区域进行地址提取☆30Updated last week
- ☆63Updated 6 months ago
- ☆18Updated last month
- ☆60Updated 2 weeks ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year
- P2P: Automated Paper-to-Poster Generation and Fine-Grained Benchmark☆24Updated last week
- ☆81Updated this week
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆179Updated 2 months ago
- llm & rl☆134Updated last week
- Awesome Agent Training☆131Updated this week
- ☆60Updated last year
- 本项目借助飞桨平台,构建起一套创新的多模型协同系统,实现 PDF 文件到 Markdown 文件的高效、精准转换。☆12Updated 2 months ago
- Music large model based on InternLM2-chat.☆22Updated 5 months ago
- ☆77Updated 2 months ago
- ☆52Updated 8 months ago
- Reproduction of the complete process of DeepSeek-R1 on small-scale models, including Pre-training, SFT, and RL.☆26Updated 2 months ago