A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories
☆38Sep 4, 2024Updated last year
Alternatives and similar repositories for DevEval
Users that are interested in DevEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆28Jul 20, 2024Updated last year
- An Evolving Code Generation Benchmark Aligned with Real-world Code Repositories☆70Aug 15, 2024Updated last year
- A collection of practical code generation tasks and tests in open source projects. Complementary to HumanEval by OpenAI.☆156Dec 25, 2024Updated last year
- Large Language Models for Software Engineering: A Systematic Literature Review☆106Dec 8, 2025Updated 4 months ago
- Official implementation for the paper, StackEval: Benchmarking LLMs in Coding Assistance, https://arxiv.org/abs/2412.05288☆20Oct 30, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- "Why do I feel offended?" - Korean Dataset for Offensive Language Identification (EACL2023 Findings)☆15May 14, 2023Updated 2 years ago
- ☆14May 28, 2024Updated last year
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆106Mar 6, 2025Updated last year
- A Comprehensive Benchmark for Software Development.☆132May 30, 2024Updated last year
- ☆32Jan 14, 2025Updated last year
- The Infibench variant of bigcode-evaluation-harness --- a framework for the evaluation of autoregressive code generation language models.☆14Oct 19, 2024Updated last year
- ☆38Apr 8, 2026Updated 3 weeks ago
- [EMNLP 2023] CodeTransOcean: A Comprehensive Multilingual Benchmark for Code Translation☆59Nov 16, 2023Updated 2 years ago
- ☆14Dec 12, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The official codes for our paper at COLING 2022: Semantic-Preserving Adversarial Code Comprehension☆12Oct 23, 2022Updated 3 years ago
- Official repository of the paper: Marking Code Without Breaking It: Code Watermarking for Detecting LLM-Generated Code (Findings of EACL …☆12Mar 26, 2026Updated last month
- ☆28Nov 10, 2025Updated 5 months ago
- Include ML DL RL, knowledge and code☆12Feb 12, 2023Updated 3 years ago
- Baselines for all tasks from Long Code Arena benchmarks 🏟️☆39Mar 30, 2025Updated last year
- Code and data for AAAI 2022 paper "Multilingual Code Snippets Training for Program Translation"☆10Mar 7, 2022Updated 4 years ago
- The dataset, source code and the results of our ESEC/FSE 2023 paper "An Extensive Study on Adversarial Attack against Pre-trained Models …☆15Feb 19, 2024Updated 2 years ago
- GAOGAO-Bench-Updates is a supplement to the GAOKAO-Bench, a dataset to evaluate large language models.☆42Jan 7, 2025Updated last year
- ☆18Mar 18, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- SALNet: Semi-supervised Few-Shot Text Classification with Attention-based Lexicon Construction☆25Jun 18, 2022Updated 3 years ago
- A collection of some awesome public projects about LLM-based Web Agents and Tools.☆12Apr 25, 2024Updated 2 years ago
- ☆26Oct 24, 2025Updated 6 months ago
- Benchmark ClassEval for class-level code generation.☆146Oct 24, 2024Updated last year
- A benchmark for logging statement generation.☆27Nov 3, 2024Updated last year
- Code for ICML2020 "Sequence Generation with Mixed Representations"☆12Jun 27, 2020Updated 5 years ago
- [ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".☆270Oct 30, 2024Updated last year
- 🦮 Code and pretrained models for Findings of ACL 2022 paper "LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrie…☆49Apr 25, 2022Updated 4 years ago
- This repo contains the dataset for paper: Creating a Dataset Supporting Translation Between OpenMP Fortran and C++ Code☆15Dec 1, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [EMNLP 2024] FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents☆22Jan 6, 2025Updated last year
- Convert pretrained RoBerta models to various long-document transformer models☆11Apr 5, 2022Updated 4 years ago
- [EMNLP 2024] RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning☆15May 13, 2025Updated 11 months ago
- Building RESTful API with Laravel [Video], published by Packt☆12Jan 15, 2021Updated 5 years ago
- ☆14Feb 18, 2025Updated last year
- ☆10Jul 11, 2022Updated 3 years ago
- 功能丰富的在线文本清理工具,可用于 PDF、PPT、CAJ 等文字复制格式化,去除多余的空格与换行☆19Jan 23, 2023Updated 3 years ago