lapisrocks / LanguageAgentTreeSearchLinks

[ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"

☆768

Alternatives and similar repositories for LanguageAgentTreeSearch

Users that are interested in LanguageAgentTreeSearch are comparing it to the libraries listed below

Sorting:

ezelikman / quiet-star
Code for Quiet-STaR
☆737Updated 11 months ago
trotsky1997 / MathBlackBox
☆1,028Updated 7 months ago
madaan / self-refine
LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.
☆719Updated 10 months ago
allenai / lumos
Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"
☆467Updated last year
YangLing0818 / buffer-of-thought-llm
[NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models
☆652Updated last month
SalesforceAIResearch / xLAM
xLAM: A Family of Large Action Models to Empower AI Agent Systems
☆513Updated this week
princeton-nlp / WebShop
[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents
☆379Updated 11 months ago
composable-models / llm_multiagent_debate
ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate
☆459Updated 3 months ago
web-arena-x / webarena
Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"
☆1,084Updated 5 months ago
FloridSleeves / LLMDebugger
LDB: A Large Language Model Debugger via Verifying Runtime Execution Step by Step (ACL'24)
☆554Updated 10 months ago
ysymyth / awesome-language-agents
List of language agents based on paper "Cognitive Architectures for Language Agents"
☆991Updated 6 months ago
catid / self-discover
Implementation of Google's SELF-DISCOVER
☆298Updated 11 months ago
yuchenlin / LLM-Blender
[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…
☆956Updated 9 months ago
SalesforceAIResearch / AgentLite
☆618Updated 6 months ago
microsoft / Everything-of-Thoughts-XoT
An implemtation of Everyting of Thoughts (XoT).
☆148Updated last year
SwiftSage / SwiftSage
SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks
☆311Updated 9 months ago
OpenLemur / Lemur
[ICLR 2024] Lemur: Open Foundation Models for Language Agents
☆553Updated last year
google-deepmind / long-form-factuality
Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models".
☆627Updated 3 weeks ago
sierra-research / tau-bench
Code and Data for Tau-Bench
☆713Updated 3 weeks ago
hkust-nlp / AgentBoard
An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]
☆335Updated last year
zhentingqi / rStar
☆954Updated 6 months ago
tmgthb / Autonomous-Agents
Autonomous Agents (LLMs) research papers. Updated Daily.
☆906Updated 2 weeks ago
Gentopia-AI / Gentopia
Build Hierarchical Autonomous Agents through Config. Collaborative Growth of Specialized Agents.
☆320Updated last year
dyabel / AnyTool
☆306Updated last year
zorazrw / agent-workflow-memory
AWM: Agent Workflow Memory
☆300Updated 6 months ago
suzgunmirac / meta-prompting
Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding
☆397Updated last year
maitrix-org / llm-reasoners
A library for advanced large language model reasoning
☆2,193Updated last month
openai / mle-bench
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
☆823Updated last month
karthikv792 / LLMs-Planning
An extensible benchmark for evaluating large language models on planning
☆393Updated last month
nexusflowai / NexusRaven-V2
☆415Updated last year