bytedance / SandboxFusionLinks
☆586Updated 2 months ago
Alternatives and similar repositories for SandboxFusion
Users that are interested in SandboxFusion are comparing it to the libraries listed below
Sorting:
- ☆815Updated 3 months ago
- Build, evaluate and train General Multi-Agent Assistance with ease☆742Updated this week
- A flexible and efficient training framework for large-scale alignment tasks☆425Updated this week
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆548Updated 4 months ago
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆341Updated this week
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆689Updated last month
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆253Updated 7 months ago
- ☆803Updated 3 weeks ago
- Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving☆249Updated last week
- ☆742Updated 2 weeks ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆596Updated 5 months ago
- Ling is a MoE LLM provided and open-sourced by InclusionAI.☆201Updated 4 months ago
- Official repository for our paper "FullStack Bench: Evaluating LLMs as Full Stack Coders"☆103Updated 4 months ago
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆633Updated last month
- Agentic Foundation Platform☆471Updated this week
- An Awesome List of Agentic Model trained with Reinforcement Learning☆469Updated last week
- Deep Research Agent CognitiveKernel-Pro from Tencent AI Lab. Paper: https://arxiv.org/pdf/2508.00414☆348Updated 3 weeks ago
- ☆336Updated 3 months ago
- ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning☆1,189Updated 4 months ago
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆799Updated 2 months ago
- slime is a LLM post-training framework for RL Scaling.☆1,827Updated this week
- A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.☆244Updated 5 months ago
- Distributed RL System for LLM Reasoning☆2,614Updated this week
- A series of technical report on Slow Thinking with LLM☆734Updated last month
- SkyRL: A Modular Full-stack RL Library for LLMs☆862Updated last week
- a-m-team's exploration in large language modeling☆188Updated 3 months ago
- ☆201Updated 5 months ago
- ☆1,083Updated this week
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆713Updated 3 months ago
- ☆393Updated 3 weeks ago