shouldnotappearcalm / Code-Runner-Sandbox
Code execution sandbox(support Open-R1), Supports multiple languages(Python/Java/C/Kotlin/Swift/OC/GO/...)
☆16Updated last month
Alternatives and similar repositories for Code-Runner-Sandbox:
Users that are interested in Code-Runner-Sandbox are comparing it to the libraries listed below
- ☆101Updated 4 months ago
- ☆51Updated 7 months ago
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆42Updated 10 months ago
- Knowledge-Reasoning Synergy Reinforcement Learning.☆34Updated last month
- ☆63Updated 5 months ago
- ☆31Updated 5 months ago
- An Open Math Pre-trainng Dataset with 370B Tokens.☆72Updated 3 weeks ago
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆70Updated this week
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆74Updated last month
- ☆153Updated last month
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated 10 months ago
- ☆47Updated 4 months ago
- ☆41Updated last week
- Open source code of the paper: "OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain"☆55Updated 4 months ago
- ☆94Updated 4 months ago
- Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024☆57Updated 5 months ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆237Updated last week
- MPO: Boosting LLM Agents with Meta Plan Optimization☆50Updated last month
- ☆146Updated last month
- Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 …☆32Updated 4 months ago
- A curated list of awesome works in Routing LLMs paradigm (👉 Welcome to submit your contributions to this code repository)☆30Updated last month
- ☆16Updated 2 weeks ago
- On Memorization of Large Language Models in Logical Reasoning☆65Updated 3 weeks ago
- Reformatted Alignment☆115Updated 7 months ago
- [preprint] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆43Updated 4 months ago
- ☆143Updated 9 months ago
- The demo, code and data of FollowRAG☆72Updated this week
- ☆39Updated 11 months ago
- Awesome Agent Training☆72Updated this week
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆148Updated 7 months ago