OSU-NLP-Group / TravelPlannerLinks
[ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"
☆383Updated 2 weeks ago
Alternatives and similar repositories for TravelPlanner
Users that are interested in TravelPlanner are comparing it to the libraries listed below
Sorting:
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]☆326Updated last year
- Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi e…☆493Updated 3 months ago
- papers related to LLM-agent that published on top conferences☆315Updated 2 months ago
- This is the repository for the Tool Learning survey.☆395Updated last month
- FireAct: Toward Language Agent Fine-tuning☆279Updated last year
- ☆256Updated last year
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆573Updated last month
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆638Updated 5 months ago
- Implementation of "RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation".☆240Updated last year
- An extensible benchmark for evaluating large language models on planning☆384Updated this week
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆215Updated last month
- Towards Large Multimodal Models as Visual Foundation Agents☆216Updated 2 months ago
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆228Updated 5 months ago
- ☆136Updated 6 months ago
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …☆269Updated last year
- ☆222Updated last year
- [NAACL 2025] KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents☆228Updated 4 months ago
- A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.☆156Updated 2 months ago
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆566Updated last month
- [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents☆358Updated 9 months ago
- [ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step☆276Updated last year
- RewardBench: the first evaluation tool for reward models.☆604Updated 2 weeks ago
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆145Updated 3 months ago
- This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.☆533Updated 7 months ago
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆141Updated 6 months ago
- ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)☆262Updated last year
- ☆541Updated 5 months ago
- Survey on LLM Agents (Published on CoLing 2025)