Tongyi-Zhiwen / Qwen-DocLinks
☆299Updated 5 months ago
Alternatives and similar repositories for Qwen-Doc
Users that are interested in Qwen-Doc are comparing it to the libraries listed below
Sorting:
- ☆85Updated 7 months ago
- ☆172Updated 6 months ago
- ☆90Updated 5 months ago
- [ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation☆117Updated 5 months ago
- MiroMind-M1 is a fully open-source series of reasoning language models built on Qwen-2.5, focused on advancing mathematical reasoning.☆239Updated 2 months ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆263Updated 4 months ago
- Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"☆147Updated 5 months ago
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆220Updated 3 months ago
- ☆320Updated last year
- Efficient Agent Training for Computer Use☆132Updated 2 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆280Updated 3 weeks ago
- [NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆176Updated 4 months ago
- Ling is a MoE LLM provided and open-sourced by InclusionAI.☆231Updated 5 months ago
- ☆159Updated this week
- Data Synthesis for Deep Research Based on Semi-Structured Data☆176Updated 3 weeks ago
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆450Updated 5 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆138Updated last year
- Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI.☆217Updated last month
- ☆73Updated 5 months ago
- [COLM 2025] An Open Math Pre-trainng Dataset with 370B Tokens.☆106Updated 7 months ago
- Deep Reasoning Translation (DRT) Project☆236Updated 2 months ago
- Mixture-of-Experts (MoE) Language Model☆191Updated last year
- Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs☆193Updated last month
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆62Updated 4 months ago
- Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"☆450Updated this week
- ☆95Updated 11 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]☆192Updated last week
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆89Updated 5 months ago
- An Open-Source Large-Scale Reinforcement Learning Project for Search Agents☆483Updated last month
- AN O1 REPLICATION FOR CODING☆336Updated 10 months ago