YerbaPage / MGDebuggerLinks
Multi-Granularity LLM Debugger [ICSE2026]
☆93Updated 4 months ago
Alternatives and similar repositories for MGDebugger
Users that are interested in MGDebugger are comparing it to the libraries listed below
Sorting:
- ☆125Updated 6 months ago
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆94Updated 6 months ago
- This repository contains popular code generation frameworks such as MapCoder, CodeSIM.☆69Updated 4 months ago
- ☆67Updated 7 months ago
- Data Synthesis for Deep Research Based on Semi-Structured Data☆177Updated last week
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆101Updated 2 months ago
- [ACL 2025] Agentic Knowledgeable Self-awareness☆89Updated 5 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆112Updated 5 months ago
- LIMI: Less is More for Agency☆148Updated last month
- ☆92Updated last year
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- ☆160Updated last year
- ☆61Updated 11 months ago
- ☆60Updated 4 months ago
- ☆102Updated last year
- [ACL 2025] An inference-time decoding strategy with adaptive foresight sampling☆106Updated 6 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆95Updated 6 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆173Updated 10 months ago
- ☆122Updated 5 months ago
- ☆48Updated last year
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆128Updated last year
- [EMNLP'2025 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"☆66Updated 7 months ago
- FuseAI Project☆87Updated 9 months ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆44Updated 3 months ago
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆33Updated 2 months ago
- accompanying material for sleep-time compute paper☆117Updated 6 months ago
- [NeurIPS 2025 Spotlight] ReasonFlux-Coder: Open-Source LLM Coders with Co-Evolving Reinforcement Learning☆131Updated 2 months ago
- ☆18Updated 7 months ago
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆33Updated last month
- A repository for research on medium sized language models.☆78Updated last year