bytedance / SandboxFusionLinks
☆760Updated 4 months ago
Alternatives and similar repositories for SandboxFusion
Users that are interested in SandboxFusion are comparing it to the libraries listed below
Sorting:
- Build, evaluate and train General Multi-Agent Assistance with ease☆1,019Updated this week
- ☆847Updated 2 months ago
- ☆748Updated 2 months ago
- ☆817Updated 5 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆657Updated last month
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆560Updated 6 months ago
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆792Updated 3 months ago
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆404Updated last week
- Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving☆282Updated 2 weeks ago
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆265Updated 9 months ago
- Awesome List for Agentic RL☆542Updated 2 weeks ago
- Ling is a MoE LLM provided and open-sourced by InclusionAI.☆233Updated 6 months ago
- ☆231Updated 3 months ago
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆653Updated 3 months ago
- Official repository for our paper "FullStack Bench: Evaluating LLMs as Full Stack Coders"☆107Updated 6 months ago
- A flexible and efficient training framework for large-scale alignment tasks☆439Updated last month
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆881Updated 4 months ago
- ☆382Updated last month
- ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning☆1,246Updated 6 months ago
- Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.☆3,015Updated last week
- [NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆618Updated 8 months ago
- The evaluation benchmark on MCP servers☆225Updated 2 months ago
- A series of technical report on Slow Thinking with LLM☆747Updated 3 months ago
- ☆1,229Updated last week
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆730Updated 5 months ago
- A version of verl to support diverse tool use☆701Updated this week
- ☆423Updated last month
- ZeroSearch: Incentivize the Search Capability of LLMs without Searching☆1,199Updated 3 months ago
- An Open-Source Large-Scale Reinforcement Learning Project for Search Agents☆497Updated last month
- Inference code of Lingma SWE-GPT☆251Updated 11 months ago