NEUIR / INTERVENORLinks
Source code for paper: INTERVENOR : Prompt the Coding Ability of Large Language Models with the Interactive Chain of Repairing
☆26Updated 10 months ago
Alternatives and similar repositories for INTERVENOR
Users that are interested in INTERVENOR are comparing it to the libraries listed below
Sorting:
- [FORGE 2025] Graph-based method for end-to-end code completion with context awareness on repository☆66Updated last year
- ☆66Updated 10 months ago
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆59Updated last year
- StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback☆71Updated last year
- ☆53Updated last year
- Training and Benchmarking LLMs for Code Preference.☆36Updated 11 months ago
- Official code for the paper "CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules"☆47Updated 9 months ago
- InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srw☆62Updated last year
- This is the code repo for our paper "Enhancing Knowledge Integration and Utilization of Large Language Models via Constructivist Cognitio…☆108Updated last week
- Run SWE-bench evaluations remotely☆41Updated 2 months ago
- Code for Search-in-the-Chain: Towards Accurate, Credible and Traceable Large Language Models for Knowledge-intensive Tasks☆58Updated last year
- ☆101Updated last year
- ☆115Updated 4 months ago
- Enhancing AI Software Engineering with Repository-level Code Graph☆216Updated 6 months ago
- Codes and Data for ACL 2024 Paper "Faithful Logical Reasoning via Symbolic Chain-of-Thought".☆193Updated last year
- Large Language Models Meet NL2Code: A Survey☆35Updated 11 months ago
- ☆160Updated last year
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆163Updated last year
- ☆35Updated 11 months ago
- ✨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024☆172Updated last year
- [NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?☆131Updated last year
- ☆41Updated 3 months ago
- PGRAG☆51Updated last year
- A distributed, extensible, secure solution for evaluating machine generated code with unit tests in multiple programming languages.☆56Updated 11 months ago
- Harness used to benchmark aider against SWE Bench benchmarks☆75Updated last year
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆49Updated last year
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆68Updated last year
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆64Updated 2 years ago
- [Preprint] Learning to Filter Context for Retrieval-Augmented Generaton☆198Updated last year
- ☆30Updated 4 months ago