YerbaPage / MGDebuggerLinks
Multi-Granularity LLM Debugger [ICSE2026]
☆91Updated 3 months ago
Alternatives and similar repositories for MGDebugger
Users that are interested in MGDebugger are comparing it to the libraries listed below
Sorting:
- This repository contains popular code generation frameworks such as MapCoder, CodeSIM.☆64Updated 4 months ago
- ☆121Updated 5 months ago
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆93Updated 5 months ago
- ☆67Updated 6 months ago
- accompanying material for sleep-time compute paper☆117Updated 5 months ago
- Data Synthesis for Deep Research Based on Semi-Structured Data☆174Updated 2 weeks ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆101Updated last month
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆108Updated 4 months ago
- ☆58Updated 4 months ago
- ☆22Updated 3 months ago
- LIMI: Less is More for Agency☆141Updated 2 weeks ago
- ☆101Updated last year
- [ACL 2025] Agentic Knowledgeable Self-awareness☆87Updated 4 months ago
- ☆120Updated 4 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆172Updated 9 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 11 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆94Updated 5 months ago
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆29Updated 6 months ago
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆14Updated 6 months ago
- ☆119Updated last year
- ☆92Updated 11 months ago
- ☆160Updated last year
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- [EMNLP'2025 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"☆65Updated 6 months ago
- Efficient Agent Training for Computer Use☆131Updated last month
- [ACL 2025] An inference-time decoding strategy with adaptive foresight sampling☆106Updated 5 months ago
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆32Updated last month
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆71Updated 3 months ago
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆127Updated last year
- Verifiers for LLM Reinforcement Learning☆77Updated 6 months ago