TheAgentArk / ToucanLinks
Official repo of Toucan: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments
☆194Updated 3 weeks ago
Alternatives and similar repositories for Toucan
Users that are interested in Toucan are comparing it to the libraries listed below
Sorting:
- MCPMark is a comprehensive, stress-testing MCP benchmark designed to evaluate model and agent capabilities in real-world MCP use.☆357Updated last week
- Deep Research☆303Updated 4 months ago
- ☆52Updated 3 months ago
- Omni Model Benchmark with high quality and diversity, which reveals the Compositional Law. We’re now focused on Chinese scenarios — and a…☆76Updated this week
- [EMNLP 2025] RAG-Instruct: Boosting LLMs with Diverse Retrieval-Augmented Instructions☆137Updated 8 months ago
- Official resources of "The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reaso…☆15Updated 6 months ago
- [FSE'2026] SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks☆131Updated this week
- [NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents☆132Updated 9 months ago
- ☆106Updated 3 weeks ago
- Official repo of "Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens"☆249Updated this week
- ☆213Updated 7 months ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆156Updated 6 months ago
- Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay☆140Updated 7 months ago
- Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework☆163Updated 3 weeks ago
- MiroRL is an MCP-first reinforcement learning framework for deep research agent.☆186Updated 4 months ago
- A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architect…☆130Updated 2 months ago
- ☆176Updated last month
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆69Updated 7 months ago
- PSFT is a trust-region–inspired fine-tuning objective that views SFT as a policy gradient method with constant advantages, constraining p…☆34Updated 4 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆254Updated 8 months ago
- [NeurIPS'25 Spotlight] ARM: Adaptive Reasoning Model☆62Updated 2 months ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆300Updated 2 months ago
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆511Updated 4 months ago
- ✨✨Latest Papers and Datasets on Mobile and PC GUI Agent☆143Updated last year
- Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization☆81Updated 2 weeks ago
- ☆41Updated 4 months ago
- [R]einforcement [L]earning from [M]odel-rewarded [T]hinking - code for the paper "Language Models That Think, Chat Better"☆122Updated 2 months ago
- [ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis☆173Updated 3 months ago
- ☆21Updated last month
- This is the repository of DEER, a Dynamic Early Exit in Reasoning method for Large Reasoning Language Models.☆176Updated 6 months ago