NEUIR / INTERVENORLinks
Source code for paper: INTERVENOR : Prompt the Coding Ability of Large Language Models with the Interactive Chain of Repairing
☆28Updated last year
Alternatives and similar repositories for INTERVENOR
Users that are interested in INTERVENOR are comparing it to the libraries listed below
Sorting:
- ☆68Updated last year
- [FORGE 2025] Graph-based method for end-to-end code completion with context awareness on repository☆69Updated last year
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆167Updated last year
- RepoQA: Evaluating Long-Context Code Understanding☆125Updated last year
- ☆54Updated last year
- StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback☆73Updated last year
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆60Updated last year
- Official code for the paper "CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules"☆48Updated last month
- InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srw☆64Updated last year
- Training and Benchmarking LLMs for Code Preference.☆37Updated last year
- ☆159Updated last year
- Large Language Models Meet NL2Code: A Survey☆35Updated last year
- Data and evaluation scripts for "CodePlan: Repository-level Coding using LLMs and Planning", FSE 2024☆79Updated last year
- Advancing LLM with Diverse Coding Capabilities☆80Updated last year
- Run SWE-bench evaluations remotely☆46Updated 4 months ago
- Code for Search-in-the-Chain: Interactively Enhancing Large Language Models with Search for Knowledge-intensive Tasks (WWW 2024))☆58Updated last month
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆63Updated last year
- Official implementation of paper How to Understand Whole Repository? New SOTA on SWE-bench Lite (21.3%)☆95Updated 8 months ago
- ☆40Updated 7 months ago
- Enhancing AI Software Engineering with Repository-level Code Graph☆237Updated 8 months ago
- ☆128Updated 6 months ago
- Harness used to benchmark aider against SWE Bench benchmarks☆78Updated last year
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆37Updated 11 months ago
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆69Updated last year
- ☆102Updated last year
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆14Updated 8 months ago
- This is the code repo for our paper "Enhancing Knowledge Integration and Utilization of Large Language Models via Constructivist Cognitio…☆112Updated 2 months ago
- A distributed, extensible, secure solution for evaluating machine generated code with unit tests in multiple programming languages.☆61Updated last year
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- [NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?☆135Updated last year