lapisrocks / LanguageAgentTreeSearch
[ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"
☆754Updated 9 months ago
Alternatives and similar repositories for LanguageAgentTreeSearch
Users that are interested in LanguageAgentTreeSearch are comparing it to the libraries listed below
Sorting:
- Code for Quiet-STaR☆731Updated 8 months ago
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆464Updated last year
- ☆931Updated 3 months ago
- [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents☆339Updated 8 months ago
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models☆632Updated last month
- List of language agents based on paper "Cognitive Architectures for Language Agents"☆946Updated 4 months ago
- LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.☆685Updated 7 months ago
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,381Updated last year
- ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate☆430Updated 3 weeks ago
- SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks☆308Updated 6 months ago
- A library for advanced large language model reasoning☆2,122Updated last month
- Implementation of Google's SELF-DISCOVER☆295Updated 9 months ago
- ☆1,019Updated 4 months ago
- Autonomous Agents (LLMs) research papers. Updated Daily.☆798Updated this week
- xLAM: A Family of Large Action Models to Empower AI Agent Systems☆432Updated this week
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,152Updated last year
- [ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning☆653Updated 11 months ago
- RewardBench: the first evaluation tool for reward models.☆566Updated last week
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"☆985Updated 3 months ago
- ☆596Updated 4 months ago
- LDB: A Large Language Model Debugger via Verifying Runtime Execution Step by Step☆526Updated 8 months ago
- Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding☆382Updated last year
- An extensible benchmark for evaluating large language models on planning☆361Updated 3 weeks ago
- Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi e…☆462Updated 2 months ago
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]☆314Updated 11 months ago
- Official repository for ORPO☆452Updated 11 months ago
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆840Updated last week
- Code and Data for Tau-Bench☆485Updated 3 months ago
- Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them☆488Updated 10 months ago
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering☆703Updated last week