DeepSoftwareAnalytics / Awesome-Agent4SELinks
β101Updated last year
Alternatives and similar repositories for Awesome-Agent4SE
Users that are interested in Awesome-Agent4SE are comparing it to the libraries listed below
Sorting:
- Beating the GAIA benchmark with Transformers Agents. πβ138Updated 8 months ago
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)β188Updated 2 months ago
- CodeSage: Code Representation Learning At Scale (ICLR 2024)β114Updated last year
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agentsβ128Updated last year
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoningβ72Updated this week
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systemsβ109Updated 5 months ago
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".β242Updated last year
- β121Updated 5 months ago
- β84Updated last year
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)β122Updated 9 months ago
- Repository for βPlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makersβ, NAACL24β149Updated last year
- LIMI: Less is More for Agencyβ148Updated last month
- β60Updated 4 months ago
- Data Synthesis for Deep Research Based on Semi-Structured Dataβ177Updated this week
- Official Code for Oα΄α΄Ι΄-RAG: Enhanced Retrieval Augmented Reasoning with Open-Source Large Language Models (EMNLP Findings 2024)β139Updated 8 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"β116Updated 3 weeks ago
- Code for ScribeAgent paperβ64Updated 8 months ago
- β79Updated last month
- [EMNLP 2024] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.β148Updated last year
- β160Updated last year
- π§ Compare how Agent systems perform on several benchmarks. ππβ102Updated 3 months ago
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.β94Updated 6 months ago
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".β70Updated last year
- Official Repo for CRMArena and CRMArena-Proβ123Updated last week
- AWM: Agent Workflow Memoryβ353Updated 9 months ago
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code youβ¦β79Updated 10 months ago
- Code for the paper "Coding Agents with Multimodal Browsing are Generalist Problem Solvers"β90Updated 2 weeks ago
- Enhancing AI Software Engineering with Repository-level Code Graphβ225Updated 7 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)β91Updated 9 months ago
- Simple examples using Argilla tools to build AIβ56Updated 11 months ago