QwenLM / QwQLinks
QwQ is the reasoning model series developed by Qwen team, Alibaba Cloud.
☆527Updated 7 months ago
Alternatives and similar repositories for QwQ
Users that are interested in QwQ are comparing it to the libraries listed below
Sorting:
- ☆320Updated 2 months ago
- Train your Agent model via our easy and efficient framework☆1,587Updated this week
- MiroThinker is open-source agentic models trained for deep research and complex tool use scenarios.☆500Updated last week
- The official implementation of Self-Play Preference Optimization (SPPO)☆581Updated 9 months ago
- ☆817Updated 4 months ago
- adds Sequence Parallelism into LLaMA-Factory☆582Updated 2 weeks ago
- ☆905Updated this week
- Deep Research Agent CognitiveKernel-Pro from Tencent AI Lab. Paper: https://arxiv.org/pdf/2508.00414☆413Updated 2 weeks ago
- This repository introduce a comprehensive paper list, datasets, methods and tools for memory research.☆309Updated 4 months ago
- Moxin is a family of fully open-source and reproducible LLMs☆613Updated 3 months ago
- Think Beyond Images☆509Updated last month
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆449Updated 5 months ago
- The official implementation of the ICML 2024 paper "MemoryLLM: Towards Self-Updatable Large Language Models" and "M+: Extending MemoryLLM…☆245Updated 3 months ago
- ☆492Updated last month
- UI-Venus is a native UI agent designed to perform precise GUI element grounding and effective navigation using only screenshots as input.☆493Updated 2 months ago
- [COLM’25] DeepRetrieval — 🔥 The First Search Agent Trained by On-Policy Reinforcement Learning☆661Updated 2 weeks ago
- A scalable, end-to-end training pipeline for general-purpose agents☆360Updated 3 months ago
- 🛠️ DeepAgent: A General Reasoning Agent with Scalable Toolsets☆276Updated this week
- Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI (Kunlun Inc.), specializing in vision-language reasoning.☆2,943Updated 2 months ago
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆473Updated last month
- Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.☆574Updated 4 months ago
- An Open-Source Large-Scale Reinforcement Learning Project for Search Agents☆471Updated 3 weeks ago
- verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…☆1,081Updated last week
- [NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models☆272Updated 7 months ago
- ☆299Updated 5 months ago
- ✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork☆282Updated last month
- Ling is a MoE LLM provided and open-sourced by InclusionAI.☆227Updated 5 months ago
- Scaling RL on advanced reasoning models☆620Updated last week
- Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities☆1,083Updated 3 months ago
- Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".☆273Updated 8 months ago