QwenLM / Qwen3GuardLinks
Qwen3Guard is a multilingual guardrail model series developed by the Qwen team at Alibaba Cloud.
☆313Updated this week
Alternatives and similar repositories for Qwen3Guard
Users that are interested in Qwen3Guard are comparing it to the libraries listed below
Sorting:
- ☆298Updated 4 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆277Updated last week
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆346Updated 4 months ago
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆225Updated last week
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆459Updated last month
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents☆433Updated 2 months ago
- ☆174Updated last month
- Prompt-to-Leaderboard☆260Updated 5 months ago
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆185Updated 3 weeks ago
- Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…☆453Updated last month
- Code and data for the Chain-of-Draft (CoD) paper☆332Updated 7 months ago
- Code that accompanies the public release of the paper Lost in Conversation (https://arxiv.org/abs/2505.06120)☆175Updated 4 months ago
- Data Synthesis for Deep Research Based on Semi-Structured Data☆174Updated last week
- The evaluation benchmark on MCP servers☆218Updated last month
- Beating the GAIA benchmark with Transformers Agents. 🚀☆138Updated 8 months ago
- An Open-Source Large-Scale Reinforcement Learning Project for Search Agents☆464Updated 2 weeks ago
- ☆232Updated 3 months ago
- Tina: Tiny Reasoning Models via LoRA☆299Updated last month
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆246Updated 5 months ago
- ☆84Updated 6 months ago
- 🐉 Loong: Synthesize Long CoTs at Scale through Verifiers.☆451Updated 3 weeks ago
- A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning☆263Updated last month
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆554Updated 5 months ago
- MiroThinker is open-source agentic models trained for deep research and complex tool use scenarios.☆467Updated last week
- Scaling RL on advanced reasoning models☆614Updated 2 months ago
- LIMI: Less is More for Agency☆141Updated last week
- ☆89Updated 5 months ago
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆734Updated 2 months ago
- Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike stat…☆321Updated last week
- [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents☆429Updated last week