NL2Code / CodeRLinks
☆157Updated 9 months ago
Alternatives and similar repositories for CodeR
Users that are interested in CodeR are comparing it to the libraries listed below
Sorting:
- ☆120Updated 9 months ago
- ☆92Updated 3 weeks ago
- ☆93Updated 10 months ago
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆224Updated 4 months ago
- RepoQA: Evaluating Long-Context Code Understanding☆108Updated 7 months ago
- ☆416Updated this week
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆175Updated this week
- ☆51Updated 10 months ago
- Scaling Data for SWE-agents☆212Updated this week
- Enhancing AI Software Engineering with Repository-level Code Graph☆178Updated 2 months ago
- ☆83Updated last month
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆97Updated last year
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆183Updated 2 months ago
- Official implementation of paper How to Understand Whole Repository? New SOTA on SWE-bench Lite (21.3%)☆84Updated 2 months ago
- ☆20Updated last month
- Open Source WizardCoder Dataset☆158Updated last year
- Harness used to benchmark aider against SWE Bench benchmarks☆72Updated 11 months ago
- Beating the GAIA benchmark with Transformers Agents. 🚀☆120Updated 3 months ago
- AWM: Agent Workflow Memory☆271Updated 4 months ago
- CodeRAG-Bench: Can Retrieval Augment Code Generation?☆135Updated 6 months ago
- A Comprehensive Benchmark for Software Development.☆106Updated last year
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆122Updated 11 months ago
- ☆121Updated 11 months ago
- ✨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024☆164Updated 9 months ago
- Data and evaluation scripts for "CodePlan: Repository-level Coding using LLMs and Planning", FSE 2024☆69Updated 8 months ago
- CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)☆142Updated 10 months ago
- MapCoder: Multi-Agent Code Generation for Competitive Problem Solving☆145Updated 3 months ago
- ☆46Updated last year
- [NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation☆307Updated 3 months ago
- ☆47Updated 5 months ago