[DL4C @ ICLR 2025] A Benchmark for Automated Environment Setup
☆35Nov 9, 2025Updated 4 months ago
Alternatives and similar repositories for EnvBench
Users that are interested in EnvBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Building Training Datasets for Deep Learning Models in Software Engineering and Empirical Software Engineering Research☆26Jun 26, 2024Updated last year
- ⚙️ A tool for collecting executable code datasets with GitHub Actions ⚙️☆23Mar 12, 2026Updated last week
- Java bindings for tree-sitter☆59Feb 19, 2026Updated last month
- [ACL25' Findings] SWE-Dev is an SWE agent with a scalable test case construction pipeline.☆59Jul 21, 2025Updated 8 months ago
- This repo contains all the codes for SEScore implementation☆15Mar 3, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ESEC/FSE'23] Hue: A User-Adaptive Parser for Hybrid Logs☆10Aug 24, 2023Updated 2 years ago
- ☆11Jan 19, 2025Updated last year
- ☆15Jan 7, 2023Updated 3 years ago
- ☆25Oct 2, 2024Updated last year
- ☆13Aug 12, 2022Updated 3 years ago
- ☆17Mar 3, 2025Updated last year
- [NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation☆74Mar 13, 2026Updated last week
- ☆20Nov 6, 2019Updated 6 years ago
- ☆15Jul 20, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆13Sep 26, 2024Updated last year
- Implementation of "Visualize Before You Write: Imagination-Guided Open-Ended Text Generation".☆17Feb 3, 2023Updated 3 years ago
- IMPACT: A Large-scale Integrated Multimodal Patent Analysis and Creation Dataset for Design Patents (NeurIPS 2024)☆16Jul 14, 2025Updated 8 months ago
- ☆14May 20, 2022Updated 3 years ago
- introduction to dataflow analysis using julia☆14Oct 26, 2020Updated 5 years ago
- ☆17Feb 18, 2026Updated last month
- [ICSE'25] Aligning the Objective of LLM-based Program Repair☆23Mar 8, 2025Updated last year
- Automated black-box REST API testing using graph-based modeling, LLMs, and multi-agent reinforcement learning.☆45Feb 20, 2026Updated last month
- 泣き顔に見えて、可哀想...☆10Feb 16, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [ICLR 2025] "GraphEval: A Lightweight Graph-Based LLM Framework for Idea Evaluation", Tao Feng, Yihang Sun, Jiaxuan You☆18Mar 18, 2025Updated last year
- [EMNLP 2024] RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning☆15May 13, 2025Updated 10 months ago
- ☆38Jan 24, 2022Updated 4 years ago
- 功能丰富的在线文本清理工具,可用于 PDF、PPT、CAJ 等文字复制格式化,去除多余的空格与换行☆19Jan 23, 2023Updated 3 years ago
- ☆38Oct 28, 2025Updated 4 months ago
- hikalium's lifestyle guide☆12Feb 16, 2025Updated last year
- Proof-carrying code completions in Dafny☆11Apr 4, 2025Updated 11 months ago
- ☆12Jul 8, 2021Updated 4 years ago
- An ANTLR4 grammar for ECMAScript 5.1☆16Jul 13, 2017Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A toolkit for hybrid log parsing☆18Aug 23, 2023Updated 2 years ago
- 💻 Terminal-Agent with Human-in-the-Loop Learning☆39Jan 16, 2026Updated 2 months ago
- Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision☆19Apr 1, 2025Updated 11 months ago
- A generational genetic algorithm approach to Java Virtual Machine settings optimization for a variety of servers.☆21Aug 29, 2014Updated 11 years ago
- [NeurIPS 2025 D&B] 🚀 SWE-bench Goes Live!☆171Mar 9, 2026Updated 2 weeks ago
- [TMLR 2023] V1T: Large-scale mouse V1 response prediction using a Vision Transformer☆23Oct 17, 2025Updated 5 months ago
- Official code for paper Understanding the Reasoning Ability of Language Models From the Perspective of Reasoning Paths Aggregation☆21Feb 29, 2024Updated 2 years ago