YerbaPage / SWE-ExpLinks
SWE-Exp: Experience-Driven Software Issue Resolution
☆35Updated 3 months ago
Alternatives and similar repositories for SWE-Exp
Users that are interested in SWE-Exp are comparing it to the libraries listed below
Sorting:
- SWE-Debate: Competitive Multi-Agent Debate for Software Issue Resolution☆24Updated 3 months ago
- SWE-Swiss: A Multi-Task Fine-Tuning and RL Recipe for High-Performance Issue Resolution☆104Updated 4 months ago
- Official implementation of paper How to Understand Whole Repository? New SOTA on SWE-bench Lite (21.3%)☆95Updated 10 months ago
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆69Updated 9 months ago
- [NeurIPS 2025 D&B] 🚀 SWE-bench Goes Live!☆161Updated 2 weeks ago
- The official repo of "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"☆101Updated 4 months ago
- The evaluation benchmark on MCP servers☆240Updated 5 months ago
- [EMNLP 2025] Verification Engineering for RL in Instruction Following☆50Updated last month
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆118Updated 4 months ago
- AgenTracer: A Lightweight Failure Attributor for Agentic Systems☆75Updated 3 months ago
- MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models☆58Updated 6 months ago
- ☆87Updated 5 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆306Updated 4 months ago
- ☆93Updated 8 months ago
- The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution☆219Updated this week
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆61Updated last year
- Data Synthesis for Deep Research Based on Semi-Structured Data☆198Updated last month
- InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srw☆64Updated last year
- ☆56Updated last year
- ☆131Updated 9 months ago
- ☆46Updated 3 months ago
- ☆133Updated last month
- [NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning☆149Updated 4 months ago
- StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback☆74Updated last year
- BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent☆169Updated 2 months ago
- ☆70Updated 4 months ago
- [ICLR 2026] Efficient Agent Training for Computer Use☆137Updated 5 months ago
- CodeRAG-Bench: Can Retrieval Augment Code Generation?☆167Updated last year
- This is the code repo for the paper "Learning to Route Queries Across Knowledge Bases for Step-wise Retrieval-Augmented Reasoning".☆38Updated 5 months ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆62Updated 7 months ago