LDB: A Large Language Model Debugger via Verifying Runtime Execution Step by Step (ACL'24)
☆587Sep 10, 2024Updated last year
Alternatives and similar repositories for LLMDebugger
Users that are interested in LLMDebugger are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AgentCoder: multi-agent code generation framework.☆388Nov 18, 2025Updated 7 months ago
- MapCoder: Multi-Agent Code Generation for Competitive Problem Solving☆193Feb 12, 2025Updated last year
- Multi-Granularity LLM Debugger [ICSE2026]☆98Jul 6, 2025Updated 11 months ago
- The official repo for the paper Can ChatGPT replace StackOverflow? A Study on Robustness and Reliability of Large Language Model Code Gen…☆20Feb 27, 2024Updated 2 years ago
- [ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"☆840Jul 30, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The repository for paper "DebugBench: "Evaluating Debugging Capability of Large Language Models".☆86Jul 13, 2024Updated last year
- This repo is for our submission for ICSE 2025.☆20Jun 12, 2024Updated 2 years ago
- [NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning☆3,180Jan 14, 2025Updated last year
- ☆641Sep 1, 2025Updated 9 months ago
- ☆159Aug 27, 2024Updated last year
- Language Models for Code Completion: a Practical Evaluation☆13Jan 19, 2024Updated 2 years ago
- ☆28Jun 2, 2026Updated 2 weeks ago
- TeCo: an ML+Execution model for test completion☆31Jun 16, 2024Updated 2 years ago
- [NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆699Mar 16, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A distributed, extensible, secure solution for evaluating machine generated code with unit tests in multiple programming languages.☆64Oct 21, 2024Updated last year
- Agentless🐱: an agentless approach to automatically solve software development problems☆2,068Dec 22, 2024Updated last year
- [TOSEM 2026]A Systematic Literature Review on Large Language Models for Automated Program Repair☆243May 1, 2026Updated last month
- [ACL 2024] Source code for InBedder, an instruction-following text embedder☆31Oct 11, 2024Updated last year
- SWE-bench: Can Language Models Resolve Real-world Github Issues?☆5,175Apr 1, 2026Updated 2 months ago
- Violet: Selective Symbolic Execution to Detect Bad Performance Misconfiguration☆18Oct 16, 2020Updated 5 years ago
- ☆34Oct 2, 2024Updated last year
- Enhancing AI Software Engineering with Repository-level Code Graph☆282Apr 1, 2025Updated last year
- Official code release for the paper Coder Reviewer Reranking for Code Generation.☆45Feb 14, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆26Sep 23, 2024Updated last year
- A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-be…☆3,088Apr 24, 2025Updated last year
- [NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation☆322Feb 24, 2025Updated last year
- ☆676Nov 1, 2024Updated last year
- ☆118Jul 17, 2024Updated last year
- OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophist…☆1,726May 7, 2024Updated 2 years ago
- Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"☆884Jul 16, 2025Updated 11 months ago
- Repository for the paper "Large Language Model-Based Agents for Software Engineering: A Survey". Keep updating.☆546Mar 16, 2025Updated last year
- Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024☆1,764Oct 2, 2025Updated 8 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- piggybacking on the Dafny language implementation to explore interactive semi-automated verified program synthesis, combining LLMs and sy…☆17Mar 26, 2026Updated 2 months ago
- Can Language Models Solve Olympiad Programming?☆124Jan 14, 2025Updated last year
- Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""☆3,943Nov 25, 2024Updated last year
- ✨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024☆208Aug 16, 2024Updated last year
- [TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.☆3,382May 20, 2026Updated 3 weeks ago
- SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersec…☆19,496Jun 10, 2026Updated last week
- Simple example of autonomous research ran in parallel from my Aetherius Ai Assistant project. Uses Openai's GPT-3.5, GPT-4, and Microsof…☆15May 11, 2023Updated 3 years ago