YerbaPage / MGDebuggerLinks
Multi-Granularity LLM Debugger
☆89Updated last month
Alternatives and similar repositories for MGDebugger
Users that are interested in MGDebugger are comparing it to the libraries listed below
Sorting:
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆101Updated 2 months ago
- ☆112Updated 3 months ago
- ☆98Updated 11 months ago
- ☆55Updated 2 months ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆61Updated last month
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆92Updated 3 months ago
- This repository contains popular code generation frameworks such as MapCoder, CodeSIM.☆56Updated 2 months ago
- The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆94Updated last month
- ☆66Updated 4 months ago
- [ACL 2025] Agentic Knowledgeable Self-awareness☆80Updated 2 months ago
- accompanying material for sleep-time compute paper☆105Updated 3 months ago
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem”☆30Updated 2 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆92Updated 3 months ago
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆126Updated last year
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆173Updated 7 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 9 months ago
- Efficient Agent Training for Computer Use☆125Updated 2 months ago
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- ☆59Updated 8 months ago
- ☆89Updated 9 months ago
- Official repo for Learning to Reason for Long-Form Story Generation☆68Updated 4 months ago
- ☆159Updated last year
- LLM reads a paper and produce a working prototype☆57Updated 4 months ago
- ☆20Updated last year
- Official code repository for Sketch-of-Thought (SoT)☆125Updated 3 months ago
- Verifiers for LLM Reinforcement Learning☆71Updated 4 months ago
- II-Thought-RL is our initial attempt at developing a large-scale, multi-domain Reinforcement Learning (RL) dataset☆27Updated 4 months ago
- ☆108Updated 2 months ago
- ☆40Updated 8 months ago
- ☆19Updated 5 months ago