DeepSoftwareAnalytics / Awesome-Agent4SELinks
☆105Updated last year
Alternatives and similar repositories for Awesome-Agent4SE
Users that are interested in Awesome-Agent4SE are comparing it to the libraries listed below
Sorting:
- Beating the GAIA benchmark with Transformers Agents. 🚀☆144Updated 10 months ago
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆189Updated 4 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆126Updated 11 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆119Updated 7 months ago
- ☆61Updated 6 months ago
- ☆128Updated 7 months ago
- Data and evaluation scripts for "CodePlan: Repository-level Coding using LLMs and Planning", FSE 2024☆80Updated last year
- Multi-Granularity LLM Debugger [ICSE2026]☆94Updated 6 months ago
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆95Updated 7 months ago
- ☆159Updated last year
- CodeSage: Code Representation Learning At Scale (ICLR 2024)☆115Updated last year
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆132Updated last year
- ☆130Updated 8 months ago
- Enhancing AI Software Engineering with Repository-level Code Graph☆242Updated 9 months ago
- Code for the paper "Coding Agents with Multimodal Browsing are Generalist Problem Solvers"☆95Updated 2 months ago
- ☆84Updated last year
- ☆86Updated last year
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆241Updated last year
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆120Updated 2 months ago
- [ACL 2025] Agentic Knowledgeable Self-awareness☆91Updated 6 months ago
- ☆213Updated 2 weeks ago
- Run SWE-bench evaluations remotely☆50Updated 4 months ago
- AWM: Agent Workflow Memory☆376Updated 3 weeks ago
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆102Updated 5 months ago
- Official Repo for CRMArena and CRMArena-Pro☆127Updated 2 months ago
- Data Synthesis for Deep Research Based on Semi-Structured Data☆191Updated 3 weeks ago
- This repository contains popular code generation frameworks such as MapCoder, CodeSIM.☆68Updated 6 months ago
- Self-Reflection in LLM Agents: Effects on Problem-Solving Performance☆92Updated last year
- Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"☆60Updated 10 months ago
- ☆63Updated last year