QwenLM / QwQLinks

QwQ is the reasoning model series developed by Qwen team, Alibaba Cloud.

☆527

Alternatives and similar repositories for QwQ

Users that are interested in QwQ are comparing it to the libraries listed below

Sorting:

MiroMindAI / MiroThinker
MiroThinker is open-source agentic models trained for deep research and complex tool use scenarios.
☆704Updated this week
Simple-Efficient / RL-Factory
Train your Agent model via our easy and efficient framework
☆1,622Updated this week
Elvin-Yiming-Du / Survey_Memory_in_AI
This repository introduce a comprehensive paper list, datasets, methods and tools for memory research.
☆314Updated 5 months ago
InternLM / InternBootcamp
☆323Updated 2 months ago
Tencent / CognitiveKernel-Pro
Deep Research Agent CognitiveKernel-Pro from Tencent AI Lab. Paper: https://arxiv.org/pdf/2508.00414
☆459Updated last month
ai-agents-2030 / awesome-deep-research-agent
☆505Updated 2 months ago
yfzhang114 / Thyme
Think Beyond Images
☆516Updated 2 months ago
Qihoo360 / 360-LLaMA-Factory
adds Sequence Parallelism into LLaMA-Factory
☆591Updated last month
inclusionAI / UI-Venus
UI-Venus is a native UI agent designed to perform precise GUI element grounding and effective navigation using only screenshots as input.
☆498Updated 2 months ago
moxin-org / Moxin-LLM
Moxin is a family of fully open-source and reproducible LLMs
☆615Updated 4 months ago
pat-jj / DeepRetrieval
[COLM’25] DeepRetrieval — 🔥 The First Search Agent Trained by On-Policy Reinforcement Learning
☆671Updated last month
ChenmienTan / RL2
☆918Updated this week
uclaml / SPPO
The official implementation of Self-Play Preference Optimization (SPPO)
☆582Updated 10 months ago
MilkThink-Lab / RouterEval
A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in Large Language Models
☆97Updated last week
MiroMindAI / MiroFlow
MiroMind Research Agent: Fully Open-Source Deep Research Agent with Reproducible State-of-the-Art Performance on FutureX, GAIA, HLE, Brow…
☆852Updated last week
ByteDance-Seed / Seed-Thinking-v1.5
☆817Updated 5 months ago
wangyu-ustc / MemoryLLM
The official implementation of the ICML 2024 paper "MemoryLLM: Towards Self-Updatable Large Language Models" and "M+: Extending MemoryLLM…
☆255Updated 3 months ago
cmriat / l0
A scalable, end-to-end training pipeline for general-purpose agents
☆361Updated 4 months ago
SkyworkAI / Skywork-R1V
Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI (Kunlun Inc.), specializing in vision-language reasoning.
☆2,949Updated this week
ChenxinAn-fdu / POLARIS
Scaling RL on advanced reasoning models
☆632Updated last month
Tongyi-Zhiwen / Qwen-Doc
☆301Updated 5 months ago
inclusionAI / ASearcher
An Open-Source Large-Scale Reinforcement Learning Project for Search Agents
☆492Updated last month
QwenLM / ParScale
Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling
☆451Updated 6 months ago
inclusionAI / Ling
Ling is a MoE LLM provided and open-sourced by InclusionAI.
☆233Updated 6 months ago
KodCode-AI / kodcode
✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork
☆289Updated 2 months ago
langfengQ / verl-agent
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…
☆1,179Updated last month
dhcode-cpp / X-R1
minimal-cost for training 0.5B R1-Zero
☆785Updated 6 months ago
Ledzy / BAdam
[NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models
☆275Updated 8 months ago
Alpha-Innovator / InternAgent
When Agent Becomes the Scientist – Building Closed-Loop System from Hypothesis to Verification
☆786Updated last month
Tencent / llm.hunyuan.T1
☆85Updated 7 months ago