bytedance / SandboxFusionLinks
☆460Updated 3 weeks ago
Alternatives and similar repositories for SandboxFusion
Users that are interested in SandboxFusion are comparing it to the libraries listed below
Sorting:
- Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving☆203Updated last week
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆228Updated 4 months ago
- Build, evaluate and train General Multi-Agent Assistance with ease☆333Updated last week
- ☆800Updated last month
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆503Updated 3 months ago
- Official repository for our paper "FullStack Bench: Evaluating LLMs as Full Stack Coders"☆94Updated 2 months ago
- slime is a LLM post-training framework aiming for RL Scaling.☆596Updated this week
- A flexible and efficient training framework for large-scale alignment tasks☆388Updated this week
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆537Updated 2 months ago
- Ling is a MoE LLM provided and open-sourced by InclusionAI.☆175Updated 2 months ago
- A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.☆244Updated 3 months ago
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆157Updated last week
- Reproducing R1 for Code with Reliable Rewards☆232Updated 2 months ago
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆244Updated 8 months ago
- AN O1 REPLICATION FOR CODING☆335Updated 7 months ago
- A Comprehensive Survey on Long Context Language Modeling☆161Updated last week
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆136Updated this week
- ☆728Updated last month
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆588Updated last month
- ☆266Updated last month
- ☆193Updated 3 months ago
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆412Updated last month
- Inference code of Lingma SWE-GPT☆231Updated 7 months ago
- ☆270Updated last month
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆566Updated 4 months ago
- ☆162Updated 3 weeks ago
- A Comprehensive Benchmark for Software Development.☆111Updated last year
- The evaluation benchmark on MCP servers☆150Updated last month
- SkyRL: A Modular Full-stack RL Library for LLMs☆574Updated last week
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆637Updated last month