KMnO4-zx / paper-agentLinks
something for paper agent
☆11Updated last year
Alternatives and similar repositories for paper-agent
Users that are interested in paper-agent are comparing it to the libraries listed below
Sorting:
- 🎓Automatically Update agent Papers Daily using Github Actions (Update Every 12th hours)每日更新agent相关论文(已附带中文摘要翻译)☆31Updated this week
- ☆35Updated last month
- MLLM @ Game☆15Updated 7 months ago
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆77Updated last year
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆93Updated last month
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆98Updated last year
- 通义千问的DPO训练☆60Updated last year
- ☆28Updated 5 months ago
- ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory [COLM2025]☆198Updated 5 months ago
- ☆45Updated 7 months ago
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆222Updated 4 months ago
- A simple and well-tailored LLM application framework that enables you to seamlessly integrate LLM capabilities in the most "Code-Centric"…☆72Updated 2 weeks ago
- PC Agent: While You Sleep, AI Works - A Cognitive Journey into Digital World☆305Updated 7 months ago
- P2P: Automated Paper-to-Poster Generation and Fine-Grained Benchmark☆44Updated 6 months ago
- Pretrain、decay、SFT a CodeLLM from scratch 🧙♂️☆39Updated last year
- AN O1 REPLICATION FOR CODING☆336Updated last year
- (ICLR'25) A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents☆89Updated 10 months ago
- 大语言模型应用:RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛☆74Updated 10 months ago
- llm & rl☆263Updated 2 months ago
- Reproduction of the complete process of DeepSeek-R1 on small-scale models, including Pre-training, SFT, and RL.☆29Updated 9 months ago
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆450Updated this week
- 顾名思义:手搓的RAG☆130Updated last year
- 用大模型批量处理数据,现支持各种大模型做OCR,支持通义千问, 月之暗面, 百度飞桨OCR, OpenAI 和LLAVA。Use LLM to generate or clean data for academic use. Support OCR with qwen, m…☆15Updated last year
- rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking☆39Updated 11 months ago
- WWW2025 Multimodal Intent Recognition for Dialogue Systems Challenge☆129Updated last year
- ☆99Updated last month
- Official Code for "Coser: Coordinating LLM-Based Persona Simulation of Established Roles"☆155Updated last week
- LLM101n: Let's build a Storyteller 中文版☆136Updated last year
- [AAAI 2026] The Avengers: A Simple Recipe for Uniting Smaller Language Models to Challenge Proprietary Giants☆43Updated 2 weeks ago
- ☆65Updated last year