Tongyi-Zhiwen / Qwen-DocLinks
☆517Updated last month
Alternatives and similar repositories for Qwen-Doc
Users that are interested in Qwen-Doc are comparing it to the libraries listed below
Sorting:
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆469Updated 8 months ago
- ☆179Updated 8 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆304Updated 3 months ago
- ☆83Updated 9 months ago
- Ling is a MoE LLM provided and open-sourced by InclusionAI.☆238Updated 8 months ago
- An Open-Source Large-Scale Reinforcement Learning Project for Search Agents☆543Updated 2 months ago
- Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"☆147Updated 8 months ago
- [ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation☆120Updated 8 months ago
- ☆207Updated this week
- A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning☆283Updated 4 months ago
- ☆320Updated last year
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆871Updated 5 months ago
- Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"☆553Updated 2 months ago
- Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI.☆253Updated 3 months ago
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆222Updated 6 months ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆264Updated 6 months ago
- MiroRL is an MCP-first reinforcement learning framework for deep research agent.☆225Updated 5 months ago
- Official repository for DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research☆524Updated this week
- ☆92Updated 8 months ago
- [NeurIPS 2025 Spotlight] ReasonFlux (long-CoT), ReasonFlux-PRM (process reward model) and ReasonFlux-Coder (code generation)☆515Updated 4 months ago
- [ICLR 2026] Efficient Agent Training for Computer Use☆135Updated 4 months ago
- The official repo of One RL to See Them All: Visual Triple Unified Reinforcement Learning☆330Updated 7 months ago
- ☆814Updated 7 months ago
- MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search too…☆384Updated 5 months ago
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆566Updated 8 months ago
- Data Synthesis for Deep Research Based on Semi-Structured Data☆196Updated last month
- Scaling RL on advanced reasoning models☆661Updated 3 months ago
- A construction kit for reinforcement learning environment management.☆319Updated this week
- [NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆190Updated 6 months ago
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆496Updated this week