salesforce / BOLAA
☆179Updated 2 months ago
Alternatives and similar repositories for BOLAA:
Users that are interested in BOLAA are comparing it to the libraries listed below
- ☆121Updated 10 months ago
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]☆307Updated 11 months ago
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆221Updated 3 months ago
- SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks☆307Updated 5 months ago
- FireAct: Toward Language Agent Fine-tuning☆275Updated last year
- An implemtation of Everyting of Thoughts (XoT).☆141Updated last year
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆150Updated last year
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆96Updated last year
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …☆257Updated last year
- Official implementation of paper "Cumulative Reasoning With Large Language Models" (https://arxiv.org/abs/2308.04371)☆292Updated 7 months ago
- [ACL 2024] Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View☆114Updated 11 months ago
- The official repository of "ChatCoT: Tool-Augmented Chain-of-Thought Reasoning on Chat-based Large Language Models"☆42Updated last year
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆218Updated last year
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization☆136Updated 11 months ago
- AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback☆107Updated 2 weeks ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆133Updated 4 months ago
- [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents☆328Updated 7 months ago
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆98Updated last year
- [NAACL 2025] KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents☆215Updated 2 months ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆134Updated 5 months ago
- Reasoning with Language Model is Planning with World Model☆163Updated last year
- ☆218Updated 8 months ago
- Codes for our paper "ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate"☆276Updated 6 months ago
- ☆172Updated last year
- augmented LLM with self reflection☆118Updated last year
- Self-Reflection in LLM Agents: Effects on Problem-Solving Performance☆68Updated 4 months ago
- Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Ziha…☆122Updated 10 months ago
- Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback☆206Updated last year
- [ICLR 2024] MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use☆85Updated last year
- A banchmark list for evaluation of large language models.☆99Updated last month