DeepSoftwareAnalytics / Awesome-Agent4SELinks
β106Updated last year
Alternatives and similar repositories for Awesome-Agent4SE
Users that are interested in Awesome-Agent4SE are comparing it to the libraries listed below
Sorting:
- Beating the GAIA benchmark with Transformers Agents. πβ146Updated 11 months ago
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)β193Updated 2 weeks ago
- β159Updated last year
- β61Updated 7 months ago
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agentsβ132Updated last year
- CodeSage: Code Representation Learning At Scale (ICLR 2024)β116Updated last year
- β132Updated 8 months ago
- Official Code for Oα΄α΄Ι΄-RAG: Enhanced Retrieval Augmented Reasoning with Open-Source Large Language Models (EMNLP Findings 2024)β141Updated 11 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)β128Updated 11 months ago
- Repository for βPlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makersβ, NAACL24β151Updated last year
- Data and evaluation scripts for "CodePlan: Repository-level Coding using LLMs and Planning", FSE 2024β81Updated last year
- Multi-Granularity LLM Debugger [ICSE2026]β96Updated 7 months ago
- β131Updated 9 months ago
- β87Updated last year
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".β69Updated last year
- Enhancing AI Software Engineering with Repository-level Code Graphβ248Updated 10 months ago
- This repository contains popular code generation frameworks such as MapCoder, CodeSIM.β69Updated 7 months ago
- β80Updated 4 months ago
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.β96Updated 8 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"β120Updated 3 months ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoningβ96Updated 2 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systemsβ125Updated 7 months ago
- Run SWE-bench evaluations remotelyβ51Updated 5 months ago
- β84Updated last year
- [ACL 2025] Agentic Knowledgeable Self-awarenessβ91Updated 7 months ago
- [EMNLP 2024] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.β147Updated last year
- Official Repo for CRMArena and CRMArena-Proβ132Updated this week
- [NAACL2025] LiteWebAgent: The Open-Source Suite for VLM-Based Web-Agent Applicationsβ143Updated 6 months ago
- β69Updated last year
- β39Updated last year