[NeurIPS '25] GSO: Challenging Software Optimization Tasks for Evaluating SWE-Agents
☆85Apr 27, 2026Updated last month
Alternatives and similar repositories for gso
Users that are interested in gso are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML '24] R2E: Turn any GitHub Repository into a Programming Agent Environment☆148Apr 20, 2025Updated last year
- Heavyweight Python dynamic analysis framework☆18Apr 17, 2024Updated 2 years ago
- ☆14Apr 24, 2024Updated 2 years ago
- moodist☆28Apr 23, 2026Updated last month
- This repository contains the Julia code for the paper "Competitive Gradient Descent"☆25Dec 18, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆13Mar 5, 2025Updated last year
- ☆28Mar 10, 2026Updated 3 months ago
- ☆33Oct 2, 2024Updated last year
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆690Jul 29, 2025Updated 10 months ago
- [COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents☆290Jul 13, 2025Updated 11 months ago
- CodeMind is a generic framework for evaluating inductive code reasoning of LLMs. It is equipped with a static analysis component that ena…☆42Feb 18, 2026Updated 3 months ago
- Concise tutorials for distributed training using PyTorch☆10Apr 18, 2023Updated 3 years ago
- The repository contains code for Adaptive Data Optimization☆36Dec 9, 2024Updated last year
- ☆12Dec 17, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Wasabi is a toolkit designed to isolate and trigger retry bugs by combining static program analysis, large language models (LLMs), fault …☆10Oct 8, 2024Updated last year
- 将 有道单词本/不背单词/轻听英语 同步到 墨墨背单词☆13Aug 28, 2020Updated 5 years ago
- [ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI☆507Jan 3, 2026Updated 5 months ago
- Collect simple coverage information in memory.☆11Oct 6, 2022Updated 3 years ago
- [NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆699Mar 16, 2025Updated last year
- [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents☆672Jun 8, 2026Updated last week
- ☆58Jul 18, 2024Updated last year
- SimCommand is a library for writing high-performance RTL testbenches with simulation threads in Scala using chiseltest.☆15Aug 30, 2023Updated 2 years ago
- A toy symbolic execution engine, supporting the blog article ...☆18Sep 8, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [IPDPS 2024] Adaptive neighbor sampling for temporal GNN☆16Feb 17, 2025Updated last year
- Automated High-Performance GPU Kernel Generation☆114Jun 1, 2026Updated 2 weeks ago
- ☆140Oct 16, 2025Updated 7 months ago
- A practical fuzzing tool for SMT solvers☆11Nov 26, 2025Updated 6 months ago
- Microsoft question-answering dataset☆10Jun 16, 2023Updated 3 years ago
- Voila! A smart automatic pet feeder using Arduino Uno + RTC time module for scheduling + multiple sensors.☆10Jun 4, 2024Updated 2 years ago
- ☆17Mar 20, 2025Updated last year
- ☆22Sep 28, 2022Updated 3 years ago
- Factor Graph Grammars in Python☆13Jan 17, 2026Updated 4 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"☆11Oct 27, 2025Updated 7 months ago
- Graph Sampling using GPU☆52Mar 17, 2022Updated 4 years ago
- ☆18Feb 20, 2026Updated 3 months ago
- Scripts for fine-tuning an HPC Code LLM☆17Jul 19, 2024Updated last year
- Fast and Precise On-the-fly Patch Validation for All☆10Feb 24, 2023Updated 3 years ago
- [ICML 2025] Adaptive Self-improvement LLM Agentic System for ML Library Development☆17Jan 6, 2026Updated 5 months ago
- Experimental tl;dr summaries for datasets on the Hugging Face Hub!☆10Apr 4, 2024Updated 2 years ago