MoonshotAI / Kimi-ResearcherLinks
☆67Updated last month
Alternatives and similar repositories for Kimi-Researcher
Users that are interested in Kimi-Researcher are comparing it to the libraries listed below
Sorting:
- Scaling Preference Data Curation via Human-AI Synergy☆94Updated last month
- ☆70Updated this week
- ☆84Updated last week
- The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]☆87Updated 3 months ago
- A lightweight reinforcement learning framework that integrates seamlessly into your codebase, empowering developers to focus on algorithm…☆33Updated 2 months ago
- ☆79Updated last week
- Revisiting Mid-training in the Era of Reinforcement Learning Scaling☆159Updated last week
- [ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation☆111Updated 2 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆98Updated last week
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆38Updated 5 months ago
- ☆54Updated 2 weeks ago
- ☆77Updated 4 months ago
- Repo for "Z1: Efficient Test-time Scaling with Code"☆63Updated 3 months ago
- ☆90Updated 2 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains☆156Updated last month
- MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search too…☆268Updated last month
- Efficient Agent Training for Computer Use☆120Updated last month
- The code and data for the paper JiuZhang3.0☆48Updated last year
- A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architect…☆64Updated 2 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆136Updated last year
- End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆158Updated last week
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆188Updated 4 months ago
- Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI, derived from Ling.☆89Updated last month
- ☆157Updated 3 months ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆51Updated last month
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆105Updated 2 months ago
- ☆101Updated last month
- Official Implementation of APB (ACL 2025 main Oral)☆31Updated 5 months ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆118Updated last month
- [COLM 2025] An Open Math Pre-trainng Dataset with 370B Tokens.☆96Updated 4 months ago