YerbaPage / MGDebugger
From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging
☆71Updated last month
Alternatives and similar repositories for MGDebugger:
Users that are interested in MGDebugger are comparing it to the libraries listed below
- ☆85Updated 2 months ago
- LLM reads a paper and produce a working prototype☆52Updated 2 weeks ago
- Agentic Knowledgeable Self-awareness☆50Updated last week
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆86Updated last month
- Data preparation code for CrystalCoder 7B LLM☆44Updated 11 months ago
- ☆62Updated 3 weeks ago
- ☆42Updated 7 months ago
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆86Updated 2 weeks ago
- ☆24Updated 7 months ago
- This repository contains popular code generation frameworks such as MapCoder, CodeSIM.☆43Updated last week
- ☆56Updated 4 months ago
- ☆84Updated last week
- A repository for research on medium sized language models.☆76Updated 11 months ago
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆122Updated 10 months ago
- ☆44Updated 10 months ago
- accompany material for sleep time compute paper☆17Updated last week
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆52Updated 3 weeks ago
- ☆33Updated 10 months ago
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆100Updated last month
- Tina: Tiny Reasoning Models via LoRA☆55Updated this week
- Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs☆45Updated 9 months ago
- ☆73Updated last year
- ☆61Updated 7 months ago
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆32Updated 6 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆171Updated 3 months ago
- ☆20Updated 4 months ago
- ☆20Updated 10 months ago
- ☆24Updated last month
- The first dense retrieval model that can be prompted like an LM☆71Updated 7 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆50Updated 4 months ago