R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning
☆29Feb 9, 2026Updated 3 weeks ago
Alternatives and similar repositories for R1-Code-Interpreter
Users that are interested in R1-Code-Interpreter are comparing it to the libraries listed below
Sorting:
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆15Feb 9, 2026Updated 3 weeks ago
- 🧌 Live2d models for cnblog themes.☆11Apr 3, 2022Updated 3 years ago
- 此仓库是我们小组在《计算机游戏开发》课程(深圳大学)的大作业,是一个模仿《slay the spire》的卡牌游戏☆10Jun 28, 2019Updated 6 years ago
- 💀 gigasmol: a lightweight wrapper for gigachat api model for seamless use with smolagents.☆15Oct 23, 2025Updated 4 months ago
- ☆25Aug 19, 2025Updated 6 months ago
- A library of fast and accurate low fidelity dynamic models for applications in robotics☆11Jul 12, 2024Updated last year
- ☆11Jun 11, 2025Updated 8 months ago
- This repo has scripts to compare various powerful RL methods☆33Feb 23, 2026Updated last week
- Large-scale text embedding model☆38Sep 6, 2025Updated 5 months ago
- EgoToM is an egocentric theory-of-mind benchmark built on Ego4D videos, containing multi-choice questions that evaluate multimodal large …☆13Apr 1, 2025Updated 11 months ago
- A very early stages attempt at trying to use OOP concepts to help interact with Excel Formulas☆10May 8, 2024Updated last year
- Modern zip & unzip replacements☆16Aug 23, 2025Updated 6 months ago
- ☆31Sep 19, 2025Updated 5 months ago
- Offical implementation of our paper "Exploring the Potential of Diffusion Large Language Models in Code Generation".☆20Oct 29, 2025Updated 4 months ago
- Works about Cucker-Smale model and its extensions. =Keywords: ODE, Runge-Kutta methods, SDE, Euler-Maruyama method, NumPy, Matplotlib☆11Feb 14, 2024Updated 2 years ago
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆25May 31, 2025Updated 9 months ago
- ☆11Jan 3, 2024Updated 2 years ago
- Some Pwn Challenges from winesap.☆14Aug 15, 2019Updated 6 years ago
- 🔦 A minimal raytracing engine in written in C on MinilibX☆10Mar 23, 2021Updated 4 years ago
- The GPT-4 function calls used in everchanging quest for the HF game jam☆10Jul 9, 2023Updated 2 years ago
- ☆13Oct 28, 2024Updated last year
- biorbd + casadi + variational integrator☆10Apr 30, 2024Updated last year
- Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents☆23Feb 21, 2026Updated last week
- ☆32Feb 13, 2026Updated 2 weeks ago
- Wave - The Software as a Service Starter Kit, designed to help you build the SAAS of your dreams 🚀 💰☆12Jan 30, 2026Updated last month
- ☆15Mar 12, 2024Updated last year
- Risky Object Localization (ROL) in a Driving Scene Dataset☆15Dec 24, 2023Updated 2 years ago
- ☆14Jul 24, 2023Updated 2 years ago
- CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings☆65Feb 3, 2025Updated last year
- ViGiL3D: A Linguistically Diverse Dataset for 3D Visual Grounding☆17Aug 8, 2025Updated 6 months ago
- ☆12Sep 8, 2024Updated last year
- ☆12Jul 16, 2024Updated last year
- Efficiently creating diverse multi-turn Text-to-SQL training samples in just 3 steps! 🚀☆14Jan 31, 2026Updated last month
- ☆14Apr 16, 2024Updated last year
- MPI Code Generation through Domain-Specific Language Models☆14Nov 19, 2024Updated last year
- 收集用于跨境电商的ChatGPT Prompt☆12Oct 14, 2025Updated 4 months ago
- Competitive Programming Code Template☆11Nov 6, 2022Updated 3 years ago
- The official implementation of dLLM-Var☆31Nov 6, 2025Updated 3 months ago
- ☆12Nov 30, 2022Updated 3 years ago