NaturalCodeBench (Findings of ACL 2024)
☆68Oct 14, 2024Updated last year
Alternatives and similar repositories for NaturalCodeBench
Users that are interested in NaturalCodeBench are comparing it to the libraries listed below
Sorting:
- CRUXEval: Code Reasoning, Understanding, and Execution Evaluation☆166Oct 11, 2024Updated last year
- ☆16Feb 28, 2024Updated 2 years ago
- ☆21Jul 16, 2024Updated last year
- Official repository for the paper "COAST: Enhancing the Code Debugging Ability of LLMs through Communicative Agent Based Data Synthesis".☆18Feb 19, 2025Updated last year
- CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)☆174Aug 15, 2025Updated 6 months ago
- Extensive Self-Contrast Enables Feedback-Free Language Model Alignment☆21Apr 2, 2024Updated last year
- ☆10Nov 14, 2024Updated last year
- ☆12Mar 18, 2024Updated last year
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- Long Context Research☆26Jan 26, 2026Updated last month
- ☆12Mar 5, 2025Updated 11 months ago
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆49Dec 22, 2023Updated 2 years ago
- ☆21Jul 24, 2025Updated 7 months ago
- Recursive Abstractive Processing for Tree-Organized Retrieval☆10May 30, 2024Updated last year
- The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"☆56May 22, 2025Updated 9 months ago
- ☆16Nov 26, 2024Updated last year
- ☆20Nov 20, 2024Updated last year
- Repository containing the website for the EMNLP 2023 conference☆17Feb 12, 2025Updated last year
- A modified Alphazero implementation with C++ where performance matters.☆18Updated this week
- ✨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024☆189Aug 16, 2024Updated last year
- An Evolving Code Generation Benchmark Aligned with Real-world Code Repositories