NL2Code / CodeR
☆153Updated 5 months ago
Alternatives and similar repositories for CodeR:
Users that are interested in CodeR are comparing it to the libraries listed below
- Enhancing AI Software Engineering with Repository-level Code Graph☆132Updated last month
- [ACL 2024] AUTOACT: Automatic Agent Learning from Scratch for QA via Self-Planning☆206Updated last month
- ☆349Updated 2 weeks ago
- AWM: Agent Workflow Memory☆241Updated 2 weeks ago
- Official implementation of paper How to Understand Whole Repository? New SOTA on SWE-bench Lite (21.3%)☆71Updated 3 months ago
- ☆83Updated 7 months ago
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆144Updated 2 weeks ago
- CodeRAG-Bench: Can Retrieval Augment Code Generation?☆109Updated 3 months ago
- ☆114Updated 6 months ago
- ☆63Updated last month
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆97Updated last year
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆172Updated 4 months ago
- 🦀️ CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/☆213Updated 2 months ago
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆116Updated 8 months ago
- A Comprehensive Benchmark for Software Development.☆93Updated 8 months ago
- An open platform for enhancing the capability of LLMs in workflow orchestration.☆98Updated 2 months ago
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆125Updated 2 months ago
- ☆156Updated 6 months ago
- ✨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024☆144Updated 6 months ago
- ☆17Updated last month
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆336Updated 8 months ago
- Harness used to benchmark aider against SWE Bench benchmarks☆66Updated 7 months ago
- Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"☆320Updated 2 weeks ago
- MapCoder: Multi-Agent Code Generation for Competitive Problem Solving☆111Updated last week
- ☆120Updated 8 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆100Updated 5 months ago
- ☆51Updated 6 months ago
- Implementation of Google's SELF-DISCOVER☆289Updated 6 months ago
- An implemtation of Everyting of Thoughts (XoT).☆139Updated 11 months ago
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆192Updated 2 weeks ago