bytedance / SandboxFusionLinks
☆561Updated 2 months ago
Alternatives and similar repositories for SandboxFusion
Users that are interested in SandboxFusion are comparing it to the libraries listed below
Sorting:
- Build, evaluate and train General Multi-Agent Assistance with ease☆619Updated this week
- ☆812Updated 2 months ago
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆298Updated this week
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆625Updated last month
- Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving☆239Updated this week
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆546Updated 3 months ago
- A flexible and efficient training framework for large-scale alignment tasks☆415Updated last week
- Ling is a MoE LLM provided and open-sourced by InclusionAI.☆187Updated 3 months ago
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆246Updated 6 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆568Updated 4 months ago
- An Awesome List of Agentic Model trained with Reinforcement Learning☆420Updated this week
- Official repository for our paper "FullStack Bench: Evaluating LLMs as Full Stack Coders"☆100Updated 3 months ago
- ☆739Updated this week
- ☆791Updated 2 months ago
- ☆318Updated 2 months ago
- AN O1 REPLICATION FOR CODING☆335Updated 8 months ago
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆625Updated 3 weeks ago
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆767Updated last month
- slime is a LLM post-training framework aiming for RL Scaling.☆1,496Updated this week
- A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.☆245Updated 4 months ago
- A series of technical report on Slow Thinking with LLM☆726Updated 2 weeks ago
- ☆197Updated last week
- a-m-team's exploration in large language modeling☆186Updated 3 months ago
- SkyRL: A Modular Full-stack RL Library for LLMs☆765Updated this week
- ☆361Updated 2 weeks ago
- ☆198Updated 4 months ago
- ☆198Updated 2 weeks ago
- Deep Research Agent CognitiveKernel-Pro from Tencent AI Lab. Paper: https://arxiv.org/pdf/2508.00414☆327Updated this week
- A Comprehensive Survey on Long Context Language Modeling☆180Updated last month
- OpenSeek aims to unite the global open source community to drive collaborative innovation in algorithms, data and systems to develop next…☆222Updated this week