KMnO4-zx / paper-agentLinks
something for paper agent
☆11Updated 7 months ago
Alternatives and similar repositories for paper-agent
Users that are interested in paper-agent are comparing it to the libraries listed below
Sorting:
- 🎓Automatically Update agent Papers Daily using Github Actions (Update Every 12th hours)每日更新agent相关论文(已附带中文摘要翻译)☆31Updated this week
- MLLM @ Game☆14Updated 2 months ago
- ☆23Updated 3 weeks ago
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆91Updated 10 months ago
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆85Updated 4 months ago
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆64Updated 10 months ago
- 通义千问的DPO训练☆50Updated 10 months ago
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆197Updated this week
- ☆30Updated 3 weeks ago
- A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in Large Language Models☆51Updated 4 months ago
- ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory☆164Updated 2 weeks ago
- ☆12Updated last year
- ☆64Updated 8 months ago
- ☆41Updated 2 months ago
- (ICLR'25) A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents☆74Updated 5 months ago
- ☆171Updated last month
- 大语言模型应用:RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛☆65Updated 5 months ago
- ☆89Updated last month
- Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme☆138Updated 3 months ago
- P2P: Automated Paper-to-Poster Generation and Fine-Grained Benchmark☆35Updated last month
- llm & rl☆172Updated this week
- AM (Advanced Mathematics) Chat is a large language model that integrates advanced mathematical knowledge, exercises in higher mathematics…☆201Updated 11 months ago
- Repo for "VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforce…☆287Updated 3 weeks ago
- ☆27Updated 9 months ago
- The official implementation of Natural Language Fine-Tuning☆50Updated 6 months ago
- ☆66Updated last month
- ☆73Updated 2 months ago
- A mini assistant to help you read paper quickly☆50Updated 2 months ago
- Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆206Updated this week
- Pre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.☆134Updated 2 weeks ago