QwenLM / QwQLinks
QwQ is the reasoning model series developed by Qwen team, Alibaba Cloud.
☆516Updated 4 months ago
Alternatives and similar repositories for QwQ
Users that are interested in QwQ are comparing it to the libraries listed below
Sorting:
- Train your Agent model via our easy and efficient framework☆1,332Updated this week
- ☆668Updated this week
- This repository introduce a comprehensive paper list, datasets, methods and tools for memory research.☆255Updated 2 months ago
- Moxin is a family of fully open-source and reproducible LLMs☆606Updated last month
- adds Sequence Parallelism into LLaMA-Factory☆544Updated this week
- [COLM'25] DeepRetrieval - 🔥 Training Search Agent with Retrieval Outcomes via Reinforcement Learning☆607Updated last month
- verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…☆702Updated this week
- A scalable, end-to-end training pipeline for general-purpose agents☆351Updated last month
- ☆802Updated 2 months ago
- ☆288Updated 2 months ago
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆428Updated 2 months ago
- minimal-cost for training 0.5B R1-Zero☆766Updated 2 months ago
- The official implementation of Self-Play Preference Optimization (SPPO)☆575Updated 6 months ago
- [NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models☆265Updated 4 months ago
- The official implementation of the ICML 2024 paper "MemoryLLM: Towards Self-Updatable Large Language Models" and "M+: Extending MemoryLLM…☆206Updated 2 weeks ago
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆588Updated last week
- Mirix is a multi-agent personal assistant designed to track on-screen activities and answer user questions intelligently. By capturing re…☆957Updated this week
- ✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork☆253Updated 3 weeks ago
- Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.☆539Updated 2 months ago
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆698Updated 2 months ago
- Ling is a MoE LLM provided and open-sourced by InclusionAI.☆181Updated 2 months ago
- Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI (Kunlun Inc.), specializing in vision-language reasoning.☆2,927Updated last week
- ✨✨R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning☆246Updated 3 months ago
- s3 - ⚡ Efficient Yet Effective Search Agent Training via RL for RAG☆508Updated last week
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆539Updated 3 months ago
- Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents☆374Updated 6 months ago
- ☆88Updated last week
- Recipes to train the self-rewarding reasoning LLMs.☆224Updated 5 months ago
- 🐝 The First Graph Agentic Framework with RL and Prompt Optimization☆909Updated 7 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆548Updated 3 months ago