DeepSoftwareAnalytics / Awesome-Agent4SELinks
β99Updated last year
Alternatives and similar repositories for Awesome-Agent4SE
Users that are interested in Awesome-Agent4SE are comparing it to the libraries listed below
Sorting:
- Beating the GAIA benchmark with Transformers Agents. πβ136Updated 6 months ago
- β111Updated 3 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)β120Updated 7 months ago
- π§ Compare how Agent systems perform on several benchmarks. ππβ102Updated last month
- CodeSage: Code Representation Learning At Scale (ICLR 2024)β112Updated 10 months ago
- β56Updated 2 months ago
- β116Updated 4 months ago
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agentsβ127Updated last year
- Repository for βPlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makersβ, NAACL24β145Updated last year
- AWM: Agent Workflow Memoryβ316Updated 7 months ago
- Official Code for Oα΄α΄Ι΄-RAG: Enhanced Retrieval Augmented Reasoning with Open-Source Large Language Models (EMNLP Findings 2024)β132Updated 6 months ago
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)β185Updated 2 weeks ago
- β159Updated last year
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systemsβ105Updated 3 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"β116Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ111Updated 5 months ago
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"β227Updated 2 months ago
- accompanying material for sleep-time compute paperβ108Updated 4 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)β92Updated 7 months ago
- Official Repo for CRMArena and CRMArena-Proβ114Updated 2 months ago
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.β93Updated 3 months ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoningβ61Updated 2 months ago
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".β69Updated last year
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".β240Updated last year
- LLM reads a paper and produce a working prototypeβ57Updated 5 months ago
- Data and evaluation scripts for "CodePlan: Repository-level Coding using LLMs and Planning", FSE 2024β74Updated last year
- β59Updated 9 months ago
- β78Updated last year
- β204Updated last year
- Enhancing AI Software Engineering with Repository-level Code Graphβ213Updated 5 months ago