code-rag-bench / code-rag-bench
CodeRAG-Bench: Can Retrieval Augment Code Generation?
☆99Updated 2 months ago
Alternatives and similar repositories for code-rag-bench:
Users that are interested in code-rag-bench are comparing it to the libraries listed below
- [ACL 2024] AUTOACT: Automatic Agent Learning from Scratch for QA via Self-Planning☆199Updated this week
- CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)☆126Updated 5 months ago
- Enhancing AI Software Engineering with Repository-level Code Graph☆122Updated last week
- ✨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024☆140Updated 5 months ago
- A Comprehensive Benchmark for Software Development.☆88Updated 7 months ago
- ☆205Updated 5 months ago
- InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srw☆54Updated 3 months ago
- Official implementation of paper How to Understand Whole Repository? New SOTA on SWE-bench Lite (21.3%)☆71Updated 2 months ago
- RepoQA: Evaluating Long-Context Code Understanding☆103Updated 2 months ago
- NaturalCodeBench (Findings of ACL 2024)☆61Updated 3 months ago
- CRUXEval: Code Reasoning, Understanding, and Execution Evaluation☆119Updated 3 months ago
- Repoformer: Selective Retrieval for Repository-Level Code Completion (ICML 2024)☆50Updated 6 months ago
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆107Updated 2 months ago
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆148Updated 10 months ago
- ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models☆171Updated 3 months ago
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆134Updated last month
- [NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?☆115Updated 4 months ago
- ☆152Updated 4 months ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆208Updated 3 months ago
- Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models☆85Updated 5 months ago
- Open Source WizardCoder Dataset☆155Updated last year
- Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models☆84Updated 9 months ago
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆132Updated last month
- ☆159Updated this week
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆126Updated 2 months ago
- [ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".☆231Updated 2 months ago
- Source code for paper: INTERVENOR : Prompt the Coding Ability of Large Language Models with the Interactive Chain of Repairing☆26Updated last month
- ☆41Updated 7 months ago
- evol augment any dataset online☆56Updated last year
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆97Updated 4 months ago