KMnO4-zx / paper-agentLinks
something for paper agent
☆11Updated 10 months ago
Alternatives and similar repositories for paper-agent
Users that are interested in paper-agent are comparing it to the libraries listed below
Sorting:
- 🎓Automatically Update agent Papers Daily using Github Actions (Update Every 12th hours)每日更新agent相关论文(已附带中文摘要翻译)☆31Updated this week
- ☆27Updated 4 months ago
- MLLM @ Game☆14Updated 6 months ago
- ☆23Updated 7 months ago
- ☆33Updated 4 months ago
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆97Updated last year
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆222Updated 3 months ago
- ☆65Updated 11 months ago
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆91Updated this week
- rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking☆39Updated 10 months ago
- ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory [COLM2025]☆194Updated 3 months ago
- 通义千问的DPO训练☆58Updated last year
- ☆13Updated last year
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆76Updated last year
- P2P: Automated Paper-to-Poster Generation and Fine-Grained Benchmark☆42Updated 5 months ago
- Pretrain、decay、SFT a CodeLLM from scratch 🧙♂️☆39Updated last year
- llm & rl☆243Updated 3 weeks ago
- AM (Advanced Mathematics) Chat is a large language model that integrates advanced mathematical knowledge, exercises in higher mathematics…☆218Updated last year
- 用大模型批量处理数据,现支持各种大模型做OCR,支持通义千问, 月之暗面, 百度飞桨OCR, OpenAI 和LLAVA。Use LLM to generate or clean data for academic use. Support OCR with qwen, m…☆15Updated last year
- ☆91Updated 2 weeks ago
- ☆26Updated last year
- (ICLR'25) A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents☆88Updated 9 months ago
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆398Updated this week
- 项目的issue会存放我的所有blog☆15Updated 2 months ago
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the…☆168Updated 4 months ago
- 大语言模型应用:RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛☆71Updated 9 months ago
- Reproduction of the complete process of DeepSeek-R1 on small-scale models, including Pre-training, SFT, and RL.☆28Updated 8 months ago
- ☆44Updated 6 months ago
- ☆115Updated last year
- ☆125Updated 3 weeks ago