Tongyi-Zhiwen / Qwen-DocLinks
☆498Updated 3 weeks ago
Alternatives and similar repositories for Qwen-Doc
Users that are interested in Qwen-Doc are comparing it to the libraries listed below
Sorting:
- MiroMind-M1 is a fully open-source series of reasoning language models built on Qwen-2.5, focused on advancing mathematical reasoning.☆247Updated 4 months ago
- ☆178Updated 8 months ago
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆467Updated 7 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆299Updated 2 months ago
- Ling is a MoE LLM provided and open-sourced by InclusionAI.☆239Updated 7 months ago
- Scaling RL on advanced reasoning models☆654Updated 2 months ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆263Updated 6 months ago
- ☆816Updated 7 months ago
- ☆320Updated last year
- Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI.☆250Updated 3 months ago
- [NeurIPS 2025 Spotlight] ReasonFlux (long-CoT), ReasonFlux-PRM (process reward model) and ReasonFlux-Coder (code generation)☆513Updated 3 months ago
- Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"☆147Updated 7 months ago
- ☆84Updated 9 months ago
- [ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation☆118Updated 7 months ago
- A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning☆281Updated 3 months ago
- Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"☆544Updated 2 months ago
- An Open-Source Large-Scale Reinforcement Learning Project for Search Agents☆530Updated last month
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆222Updated 5 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆680Updated 2 months ago
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆845Updated 5 months ago
- Data Synthesis for Deep Research Based on Semi-Structured Data☆191Updated 3 weeks ago
- Official repository for DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research☆503Updated last week
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆511Updated 4 months ago
- AN O1 REPLICATION FOR CODING☆336Updated last year
- ☆93Updated 7 months ago
- Qwen3Guard is a multilingual guardrail model series developed by the Qwen team at Alibaba Cloud.☆394Updated 2 months ago
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆566Updated 8 months ago
- ☆195Updated 2 weeks ago
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆464Updated this week
- a toolkit on knowledge distillation for large language models☆232Updated 2 weeks ago