shouldnotappearcalm / Code-Runner-SandboxLinks
Code execution sandbox(support Open-R1), Supports multiple languages(Python/Java/C/Kotlin/Swift/OC/GO/...)
☆20Updated 3 months ago
Alternatives and similar repositories for Code-Runner-Sandbox
Users that are interested in Code-Runner-Sandbox are comparing it to the libraries listed below
Sorting:
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆58Updated last month
- ☆103Updated 6 months ago
- An Open Math Pre-trainng Dataset with 370B Tokens.☆89Updated 2 months ago
- Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 …☆33Updated 3 weeks ago
- ☆94Updated 6 months ago
- 代码大模型 预训练&微调&DPO 数据处理 业界处理pipeline sota☆42Updated 11 months ago
- ☆63Updated 7 months ago
- ☆53Updated last week
- A curated list of awesome works in Routing LLMs paradigm (👉 Welcome to submit your contributions to this code repository)☆40Updated last month
- ☆56Updated 7 months ago
- ☆273Updated 3 weeks ago
- An Awesome List of Reinforcement Learning-based Large Language Agent Works. Collect directly from official code base.☆154Updated this week
- ☆53Updated 9 months ago
- Official code implementation for the ACL 2025 paper: 'CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis'☆27Updated last month
- ☆86Updated last month
- On Memorization of Large Language Models in Logical Reasoning☆67Updated 2 months ago
- The demo, code and data of FollowRAG☆73Updated 2 months ago
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆75Updated 3 weeks ago
- A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architect…☆58Updated 3 weeks ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆133Updated last year
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆42Updated last year
- ☆47Updated 2 weeks ago
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆65Updated last month
- this is an implementation for the paper Improve Mathematical Reasoning in Language Models by Automated Process Supervision from google de…☆33Updated 2 months ago
- ☆142Updated 11 months ago
- ☆95Updated 6 months ago
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆57Updated 8 months ago
- The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmar…☆50Updated 7 months ago
- MCP-Zero: Proactive Toolchain Construction for LLM Agents from Scratch☆49Updated 2 weeks ago
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆78Updated last month