AngxiaoYue / awesome-llm-tool-learning
A list of awesome papers on LLM tool learning.
☆23Updated 6 months ago
Alternatives and similar repositories for awesome-llm-tool-learning:
Users that are interested in awesome-llm-tool-learning are comparing it to the libraries listed below
- A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.☆128Updated 5 months ago
- This is the repository for the Tool Learning survey.☆306Updated this week
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization☆128Updated 9 months ago
- Official code for AAAI2023 paper`Confucius: Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum`☆17Updated 2 weeks ago
- ☆20Updated last week
- A series of technical report on Slow Thinking with LLM☆411Updated last week
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆104Updated 5 months ago
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆207Updated last month
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆52Updated 2 months ago
- The awesome agents in the era of large language models☆59Updated last year
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆111Updated 3 months ago
- The demo, code and data of FollowRAG☆69Updated 2 months ago
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token☆119Updated 7 months ago
- The related works and background techniques about Openai o1☆211Updated last month
- papers related to LLM-agent that published on top conferences☆311Updated last year
- ☆45Updated 4 months ago
- Building a comprehensive and handy list of papers for GUI agents☆215Updated this week
- ☆65Updated last year
- [ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios☆52Updated 10 months ago
- ☆174Updated 9 months ago
- Code and Data Repo for [ICLR 2025] Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆20Updated 2 months ago
- InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks (ICML 2024)☆107Updated 2 months ago
- MAD: The first work to explore Multi-Agent Debate with Large Language Models :D☆331Updated last month
- The code and data of DPA-RAG☆56Updated last month
- RAG methods, benchmarks, and toolkits☆12Updated 2 months ago
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆155Updated last month
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆173Updated 9 months ago
- ☆258Updated 6 months ago
- This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.☆290Updated 6 months ago