DeepSoftwareAnalytics / Awesome-Agent4SE
β93Updated 7 months ago
Alternatives and similar repositories for Awesome-Agent4SE:
Users that are interested in Awesome-Agent4SE are comparing it to the libraries listed below
- Beating the GAIA benchmark with Transformers Agents. πβ113Updated 2 months ago
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".β66Updated 10 months ago
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agentsβ122Updated 10 months ago
- Data and evaluation scripts for "CodePlan: Repository-level Coding using LLMs and Planning", FSE 2024β68Updated 8 months ago
- π§ Compare how Agent systems perform on several benchmarks. ππβ95Updated 6 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)β76Updated last month
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)β109Updated 2 months ago
- Code for ScribeAgent paperβ57Updated 2 months ago
- LLM reads a paper and produce a working prototypeβ52Updated 3 weeks ago
- β155Updated 8 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)β90Updated 3 months ago
- Repository for βPlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makersβ, NAACL24β139Updated 10 months ago
- Mixing Language Models with Self-Verification and Meta-Verificationβ104Updated 4 months ago
- β79Updated 2 weeks ago
- AWM: Agent Workflow Memoryβ268Updated 3 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive argumentsβ78Updated 7 months ago
- Enhancing AI Software Engineering with Repository-level Code Graphβ159Updated last month
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"β107Updated 7 months ago
- β85Updated last week
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systemsβ90Updated last month
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.β48Updated last year
- β41Updated 4 months ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoningβ53Updated last month
- β50Updated 5 months ago
- Simple examples using Argilla tools to build AIβ52Updated 5 months ago
- β74Updated 3 months ago
- This repository contains popular code generation frameworks such as MapCoder, CodeSIM.β44Updated 2 weeks ago
- CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environmentsβ54Updated 2 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ105Updated 3 weeks ago
- CodeSage: Code Representation Learning At Scale (ICLR 2024)β103Updated 6 months ago