xlang-ai / EVOR
☆56Updated 2 months ago
Alternatives and similar repositories for EVOR:
Users that are interested in EVOR are comparing it to the libraries listed below
- Repoformer: Selective Retrieval for Repository-Level Code Completion (ICML 2024)☆51Updated 8 months ago
- CRUXEval: Code Reasoning, Understanding, and Execution Evaluation☆131Updated 4 months ago
- NaturalCodeBench (Findings of ACL 2024)☆62Updated 4 months ago
- Large Language Models Meet NL2Code: A Survey☆36Updated 3 months ago
- ☆44Updated 9 months ago
- StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback☆63Updated 6 months ago
- Advancing LLM with Diverse Coding Capabilities☆62Updated 7 months ago
- InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srw☆58Updated 5 months ago
- A distributed, extensible, secure solution for evaluating machine generated code with unit tests in multiple programming languages.☆48Updated 4 months ago
- CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)☆131Updated 7 months ago
- Data and evaluation scripts for "CodePlan: Repository-level Coding using LLMs and Planning", FSE 2024☆64Updated 6 months ago
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆150Updated last year
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆57Updated 10 months ago
- ☆28Updated 3 months ago
- [ICLR 2024] MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use☆79Updated 11 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- ☆22Updated 5 months ago
- CodeRAG-Bench: Can Retrieval Augment Code Generation?☆115Updated 3 months ago
- Official code for the paper "CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules"☆43Updated last month
- evol augment any dataset online☆58Updated last year
- ☆121Updated last year
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆50Updated 9 months ago
- Source codes for paper ”ReACC: A Retrieval-Augmented Code Completion Framework“☆62Updated 2 years ago
- Code for the TMLR 2023 paper "PPOCoder: Execution-based Code Generation using Deep Reinforcement Learning"☆109Updated last year
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆78Updated 6 months ago
- Training and Benchmarking LLMs for Code Preference.☆33Updated 3 months ago
- Code for paper "LEVER: Learning to Verifiy Language-to-Code Generation with Execution" (ICML'23)☆83Updated last year
- RepoQA: Evaluating Long-Context Code Understanding☆103Updated 4 months ago
- Code for the paper <SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning>☆48Updated last year
- Language Models of Code are Few-Shot Commonsense Learners (EMNLP 2022)☆86Updated last year