autoiac-project / iac-evalLinks
[NeurIPS 24] IaC-Eval: A Code Generation Benchmark for Cloud Infrastructure-as-Code programs
☆30Updated 10 months ago
Alternatives and similar repositories for iac-eval
Users that are interested in iac-eval are comparing it to the libraries listed below
Sorting:
- Zodiac: Unearthing Semantic Checks for Cloud Infrastructure-as-Code Programs, SOSP 2024☆14Updated 11 months ago
- A holistic framework to enable the design, development, and evaluation of autonomous AIOps agents.☆12Updated 5 months ago
- ☁️ Benchmarking LLMs for Cloud Config Generation | 云场景下的大模型基准测试☆37Updated last year
- Simulator for the datacenter, including power, cooling, server and other components☆16Updated 8 months ago
- [ICML '24] R2E: Turn any GitHub Repository into a Programming Agent Environment☆133Updated 6 months ago
- EvoEval: Evolving Coding Benchmarks via LLM☆79Updated last year
- Cloud incidents/failures related work.☆19Updated 9 months ago
- A series of work towards achieving ACV.☆21Updated last month
- Code repository for scenarios and environment setup as part of ITBench☆12Updated this week
- Predict the performance of LLM inference services☆20Updated last month
- A Framework for Automated Validation of Deep Learning Training Tasks☆52Updated last month
- Course information for CS598-Topics in LLM Agents(25'Spring) under the direction of Prof. Jiaxuan You ( jiaxuan@illinois.edu ).☆40Updated 6 months ago
- Serverless LLM Serving for Everyone.☆573Updated last week
- LDB: A Large Language Model Debugger via Verifying Runtime Execution Step by Step (ACL'24)☆561Updated last year
- [TOSEM'25] The official GitHub page for the survey paper "A Survey on Large Language Models for Code Generation".☆167Updated 3 months ago
- The repository for paper "DebugBench: "Evaluating Debugging Capability of Large Language Models".☆83Updated last year
- This is the repo for remote direct memory introspection.☆22Updated 2 years ago
- How much energy do GenAI models consume?☆47Updated 2 weeks ago
- Push-Button End-to-End Testing of Kubernetes Operators and Controllers☆128Updated 2 months ago
- PyTorch implementation of paper "Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline".☆92Updated 2 years ago
- Must-read papers on Repository-level Code Generation & Issue Resolution 🔥☆195Updated last week
- Artifact for "Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving" [SOSP '24]☆25Updated 11 months ago
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆218Updated last week
- Burstable Cloud Scheduler☆15Updated last year
- Systems for GenAI☆145Updated 6 months ago
- Easy, Fast, and Scalable Multimodal AI☆20Updated last week
- XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts☆35Updated last year
- [VLDB'2025] LEAP: LLM-powered End-to-end Automatic Library for Processing Social Science Queries on Unstructured Data☆19Updated 8 months ago
- [NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.☆198Updated 5 months ago
- Data and evaluation scripts for "CodePlan: Repository-level Coding using LLMs and Planning", FSE 2024☆75Updated last year