bytedance / SandboxFusionLinks

☆710

Alternatives and similar repositories for SandboxFusion

Users that are interested in SandboxFusion are comparing it to the libraries listed below

Sorting:

inclusionAI / AWorld
Build, evaluate and train General Multi-Agent Assistance with ease
☆948Updated last week
ByteDance-Seed / Seed-Thinking-v1.5
☆817Updated 4 months ago
modelscope / Trinity-RFT
Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…
☆379Updated this week
hkust-nlp / CodeIO
[ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction
☆556Updated 5 months ago
bytedance / FullStackBench
Official repository for our paper "FullStack Bench: Evaluating LLMs as Full Stack Coders"
☆106Updated 5 months ago
Qihoo360 / Light-R1
☆749Updated 2 months ago
BytedTsinghua-SIA / MemAgent
A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.
☆761Updated 3 months ago
GAIR-NLP / DeepResearcher
Scaling Deep Research via Reinforcement Learning in Real-world Environments.
☆631Updated 2 weeks ago
CharlesQ9 / Alita
☆835Updated 2 months ago
multi-swe-bench / multi-swe-bench
Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving
☆268Updated last week
MiroMindAI / MiroFlow
MiroMind Research Agent: Fully Open-Source Deep Research Agent with Reproducible State-of-the-Art Performance on FutureX, GAIA, HLE, Brow…
☆794Updated last week
inclusionAI / Ling
Ling is a MoE LLM provided and open-sourced by InclusionAI.
☆227Updated 5 months ago
alibaba / ChatLearn
A flexible and efficient training framework for large-scale alignment tasks
☆436Updated last week
HarderThenHarder / RLLoggingBoard
A visuailzation tool to make deep understaning and easier debugging for RLHF training.
☆261Updated 8 months ago
RUCAIBox / R1-Searcher
R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
☆649Updated 2 months ago
inclusionAI / AReaL
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
☆2,903Updated this week
JARVIS-Xs / SE-Agent
SE-Agent is a self-evolution framework for LLM Code agents. It enables trajectory-level evolution to exchange information across reasonin…
☆190Updated last month
0russwest0 / Agent-R1
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
☆848Updated 3 months ago
thinkwee / AgentsMeetRL
An Awesome List of Agentic Model trained with Reinforcement Learning
☆527Updated 3 weeks ago
ADaM-BJTU / O1-CODER
AN O1 REPLICATION FOR CODING
☆336Updated 10 months ago
qiancheng0 / ToolRL
☆367Updated 2 weeks ago
modelscope / MCPBench
The evaluation benchmark on MCP servers
☆220Updated 2 months ago
Alibaba-NLP / ZeroSearch
ZeroSearch: Incentivize the Search Capability of LLMs without Searching
☆1,174Updated 2 months ago
facebookresearch / swe-rl
[NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"
☆610Updated 7 months ago
RUCAIBox / Slow_Thinking_with_LLMs
A series of technical report on Slow Thinking with LLM
☆744Updated 2 months ago
LingmaTongyi / Lingma-SWE-GPT
Inference code of Lingma SWE-GPT
☆248Updated 11 months ago
sail-sg / oat-zero
A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.
☆247Updated 6 months ago
FlagAI-Open / OpenSeek
OpenSeek aims to unite the global open source community to drive collaborative innovation in algorithms, data and systems to develop next…
☆235Updated last month
SkyworkAI / Skywork-OR1
Unleashing the Power of Reinforcement Learning for Math and Code Reasoners
☆727Updated 4 months ago
Agent-RL / ReCall
ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning
☆1,230Updated 5 months ago