bytedance / SandboxFusionLinks
☆710Updated 4 months ago
Alternatives and similar repositories for SandboxFusion
Users that are interested in SandboxFusion are comparing it to the libraries listed below
Sorting:
- Build, evaluate and train General Multi-Agent Assistance with ease☆948Updated last week
- ☆817Updated 4 months ago
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆379Updated this week
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆556Updated 5 months ago
- Official repository for our paper "FullStack Bench: Evaluating LLMs as Full Stack Coders"☆106Updated 5 months ago
- ☆749Updated 2 months ago
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆761Updated 3 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆631Updated 2 weeks ago
- ☆835Updated 2 months ago
- Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving☆268Updated last week
- MiroMind Research Agent: Fully Open-Source Deep Research Agent with Reproducible State-of-the-Art Performance on FutureX, GAIA, HLE, Brow…☆794Updated last week
- Ling is a MoE LLM provided and open-sourced by InclusionAI.☆227Updated 5 months ago
- A flexible and efficient training framework for large-scale alignment tasks☆436Updated last week
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆261Updated 8 months ago
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆649Updated 2 months ago
- Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.☆2,903Updated this week
- SE-Agent is a self-evolution framework for LLM Code agents. It enables trajectory-level evolution to exchange information across reasonin…☆190Updated last month
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆848Updated 3 months ago
- An Awesome List of Agentic Model trained with Reinforcement Learning☆527Updated 3 weeks ago
- AN O1 REPLICATION FOR CODING☆336Updated 10 months ago
- ☆367Updated 2 weeks ago
- The evaluation benchmark on MCP servers☆220Updated 2 months ago
- ZeroSearch: Incentivize the Search Capability of LLMs without Searching☆1,174Updated 2 months ago
- [NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆610Updated 7 months ago
- A series of technical report on Slow Thinking with LLM☆744Updated 2 months ago
- Inference code of Lingma SWE-GPT☆248Updated 11 months ago
- A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.☆247Updated 6 months ago
- OpenSeek aims to unite the global open source community to drive collaborative innovation in algorithms, data and systems to develop next…☆235Updated last month
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆727Updated 4 months ago
- ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning☆1,230Updated 5 months ago