salesforce / BOLAALinks
☆181Updated 4 months ago
Alternatives and similar repositories for BOLAA
Users that are interested in BOLAA are comparing it to the libraries listed below
Sorting:
- FireAct: Toward Language Agent Fine-tuning☆278Updated last year
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]☆318Updated last year
- ☆121Updated 11 months ago
- SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks☆309Updated 7 months ago
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆224Updated 4 months ago
- [NAACL 2025] KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents☆223Updated 4 months ago
- ☆142Updated last year
- An implemtation of Everyting of Thoughts (XoT).☆140Updated last year
- Reasoning with Language Model is Planning with World Model☆166Updated last year
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆141Updated 7 months ago
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …☆263Updated last year
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆151Updated last year
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆97Updated last year
- A set of utilities for running few-shot prompting experiments on large-language models☆121Updated last year
- ☆172Updated last year
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization☆145Updated last year
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆136Updated 6 months ago
- [ICLR 2024] MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use☆86Updated last year
- augmented LLM with self reflection☆122Updated last year
- Data and Code for Program of Thoughts (TMLR 2023)☆274Updated last year
- [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents☆351Updated 8 months ago
- ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)☆262Updated last year
- ☆269Updated 2 years ago
- ☆229Updated 9 months ago
- Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Ziha…☆124Updated 11 months ago
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆96Updated last year
- A banchmark list for evaluation of large language models.☆116Updated last month
- Codes for our paper "ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate"☆282Updated 7 months ago
- AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback☆108Updated 2 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆110Updated 8 months ago