TheAgentArk / ToucanLinks
Official repo of Toucan: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments
☆223Updated last month
Alternatives and similar repositories for Toucan
Users that are interested in Toucan are comparing it to the libraries listed below
Sorting:
- MCPMark is a comprehensive, stress-testing MCP benchmark designed to evaluate model and agent capabilities in real-world MCP use.☆382Updated 2 weeks ago
- Deep Research☆303Updated 5 months ago
- ☆239Updated last week
- ☆57Updated 4 months ago
- Scaling Long-Horizon LLM Agent via Context-Folding☆106Updated 2 weeks ago
- Towards a Unified View of Large Language Model Post-Training☆200Updated 5 months ago
- ☆169Updated 3 weeks ago
- Official resources of "The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reaso…☆16Updated 7 months ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆314Updated last month
- Extrapolating RLVR to General Domains without Verifiers☆200Updated 5 months ago
- MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search too…☆392Updated 5 months ago
- [NeurIPS'25 Spotlight] ARM: Adaptive Reasoning Model☆64Updated 3 months ago
- [FSE'2026] SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks☆144Updated 2 weeks ago
- Research works from Tencent AI Lab regarding self-evolving agents☆82Updated last week
- Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay☆148Updated 8 months ago
- Code, Data and Model for Paper "Learning from Peers in Reasoning Models"☆27Updated 8 months ago
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"☆62Updated 4 months ago
- Test-time preferenece optimization (ICML 2025).☆178Updated 9 months ago
- [NeurIPS 2025] Thinkless: LLM Learns When to Think☆251Updated 4 months ago
- Revisiting Mid-training in the Era of Reinforcement Learning Scaling☆182Updated 6 months ago
- ☆219Updated 8 months ago
- Omni Model Benchmark with high quality and diversity, which reveals the Compositional Law. We’re now focused on Chinese scenarios — and a…☆74Updated 3 weeks ago
- ☆179Updated 2 months ago
- REverse-Engineered Reasoning for Open-Ended Generation☆91Updated 5 months ago
- The official repo of "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"☆101Updated 4 months ago
- [ICLR 2026] Adaptive Social Learning via Mode Policy Optimization for Language Agents☆47Updated last week
- 🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, multimodal rei…☆196Updated 2 months ago
- MiroRL is an MCP-first reinforcement learning framework for deep research agent.☆229Updated 5 months ago
- ☆72Updated 8 months ago
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆71Updated 8 months ago