agent-infra / sandboxLinks
All-in-One Sandbox for AI Agents that combines Browser, Shell, File, MCP and VSCode Server in a single Docker container.
☆141Updated this week
Alternatives and similar repositories for sandbox
Users that are interested in sandbox are comparing it to the libraries listed below
Sorting:
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆162Updated last week
- [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents☆407Updated this week
- ☆179Updated last week
- ☆422Updated this week
- 🐉 Loong: Synthesize Long CoTs at Scale through Verifiers.☆431Updated 3 weeks ago
- MCP-Universe is a comprehensive framework designed for developing, testing, and benchmarking AI agents☆434Updated this week
- Official code repository for Sketch-of-Thought (SoT)☆127Updated 4 months ago
- Agent computer interface for AI software engineer.☆111Updated last week
- Data Synthesis for Deep Research Based on Semi-Structured Data☆158Updated this week
- Computer Agent Arena: Test & compare AI agents in real desktop apps & web environments. Code/data coming soon!☆50Updated 5 months ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆99Updated 3 weeks ago
- MCPMark is a comprehensive, stress-testing MCP benchmark designed to evaluate model and agent capabilities in real-world MCP use.☆154Updated last week
- Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…☆388Updated 2 weeks ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆160Updated 3 months ago
- Agentic Web: Weaving the Next Web with AI Agents.☆361Updated last week
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆245Updated 4 months ago
- SWE Arena☆34Updated 2 months ago
- [NAACL2025] LiteWebAgent: The Open-Source Suite for VLM-Based Web-Agent Applications☆124Updated 2 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆260Updated last month
- AWM: Agent Workflow Memory☆325Updated 7 months ago
- Challenges for general-purpose web-browsing AI agents☆65Updated 3 months ago
- ☆81Updated 11 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆77Updated 6 months ago
- 🚀 MassGen: An Open-source Multi-Agent Scaling System Inspired by Grok Heavy and Gemini Deep Think. Join the discord channel: https://dis…☆449Updated last week
- Code for the paper "Coding Agents with Multimodal Browsing are Generalist Problem Solvers"☆85Updated 2 weeks ago
- ☆164Updated 3 weeks ago
- The evaluation benchmark on MCP servers☆211Updated 3 weeks ago
- Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.☆321Updated this week
- The official repo for the code and data of paper SMART☆36Updated 7 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆128Updated 11 months ago