YerbaPage / MGDebuggerLinks

Multi-Granularity LLM Debugger

☆87

Alternatives and similar repositories for MGDebugger

Users that are interested in MGDebugger are comparing it to the libraries listed below

Sorting:

yueqis / API-Based-Agent
☆54Updated last month
THU-KEG / Agentic-Reward-Modeling
[ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems
☆99Updated last month
InternLM / SWE-Fixer
☆108Updated 3 months ago
AlexCuadron / ThinkingAgent
Systematic evaluation framework that automatically rates overthinking behavior in large language models.
☆91Updated 2 months ago
du-nlp-lab / MLR-Copilot
☆66Updated 4 months ago
letta-ai / sleep-time-compute
accompanying material for sleep-time compute paper
☆99Updated 3 months ago
zou-group / sirius
SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning
☆61Updated 3 weeks ago
zjunlp / KnowSelf
[ACL 2025] Agentic Knowledgeable Self-awareness
☆77Updated last month
Xalp / ECHO
Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)
☆91Updated 6 months ago
kagnlp / CodeGenerator
This repository contains popular code generation frameworks such as MapCoder, CodeSIM.
☆56Updated last month
Intelligent-Internet / ii-thought
II-Thought-RL is our initial attempt at developing a large-scale, multi-domain Reinforcement Learning (RL) dataset
☆26Updated 4 months ago
DeepSoftwareAnalytics / Awesome-Agent4SE
☆96Updated 10 months ago
CogNLP / CogAGENT
☆35Updated 2 years ago
agokrani / distillKitPlus
Easy to use, High Performant Knowledge Distillation for LLMs
☆88Updated 3 months ago
LLM360 / crystalcoder-data-prep
Data preparation code for CrystalCoder 7B LLM
☆45Updated last year
agiresearch / Formal-LLM
Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents
☆125Updated last year
StigLidu / DualDistill
The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"
☆86Updated 2 weeks ago
TIGER-AI-Lab / One-Shot-CFT
The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem”
☆24Updated 2 months ago
neulab / MultiUI
Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding
☆52Updated 7 months ago
metal-chart-generation / metal
☆37Updated 2 months ago
schauppi / Self-Rewarding-Language-Models
☆46Updated last year
miralab-ai / autoreason
☆40Updated 7 months ago
rohinmanvi / Capability-Aware-and-Mid-Generation-Self-Evaluations
☆21Updated last week
xlang-ai / OSWorld-G
Scaling Computer-Use Grounding via UI Decomposition and Synthesis
☆96Updated last month
casper-hansen / OpenCoconut
OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.
☆173Updated 6 months ago
google-deepmind / llms_can_learn_rules
☆59Updated 8 months ago
arcee-ai / DAM
☆53Updated 9 months ago
dvlab-research / MR-GSM8K
Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs
☆50Updated last year
McGill-NLP / weblinx
WebLINX is a benchmark for building web navigation agents with conversational capabilities
☆156Updated 5 months ago
Alex-Gurung / ReasoningNCP
Official repo for Learning to Reason for Long-Form Story Generation
☆68Updated 3 months ago