tangqiaoyu / ToolAlpacaView external linksLinks
the official code for "ToolAlpaca: Generalized Tool Learning for Language Models with 3000 Simulated Cases"
☆883Oct 26, 2024Updated last year
Alternatives and similar repositories for ToolAlpaca
Users that are interested in ToolAlpaca are comparing it to the libraries listed below
Sorting:
- [ICLR 2024] MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use☆108Mar 21, 2024Updated last year
- [ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.☆5,525May 21, 2025Updated 8 months ago
- ☆917Jul 24, 2024Updated last year
- [ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step☆304Apr 3, 2024Updated last year
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆73May 13, 2025Updated 9 months ago
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …☆285Aug 19, 2023Updated 2 years ago
- A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.☆214Apr 15, 2025Updated 10 months ago
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆172Feb 28, 2024Updated last year
- AgentTuning: Enabling Generalized Agent Abilities for LLMs☆1,477Oct 31, 2023Updated 2 years ago
- [ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios☆69Aug 5, 2025Updated 6 months ago
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)☆3,162Feb 8, 2026Updated last week
- This is the repository for the Tool Learning survey.☆478Aug 9, 2025Updated 6 months ago
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]☆392May 20, 2024Updated last year
- Paper collection on building and evaluating language model agents via executable language grounding☆365Apr 29, 2024Updated last year
- We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tunin…☆2,799Dec 12, 2023Updated 2 years ago
- Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Ziha…☆133Jun 4, 2024Updated last year
- DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.☆1,526Jan 22, 2026Updated 3 weeks ago
- [EMNLP 2024] RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning☆15May 13, 2025Updated 9 months ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆49Aug 27, 2023Updated 2 years ago
- The implementation for CIKM 2024: Towards Completeness-Oriented Tool Retrieval for Large Language Models.☆24Nov 6, 2024Updated last year
- Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins☆2,787Dec 5, 2023Updated 2 years ago
- ☆164Apr 17, 2023Updated 2 years ago
- ☆31Jun 12, 2024Updated last year
- NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…☆318Sep 29, 2023Updated 2 years ago
- GPT4Tools is an intelligent system that can automatically decide, control, and utilize different visual foundation models, allowing the u…☆773Dec 19, 2023Updated 2 years ago
- A large-scale 7B pretraining language model developed by BaiChuan-Inc.☆5,686Jul 18, 2024Updated last year
- Collection of papers for scalable automated alignment.☆93Oct 22, 2024Updated last year
- ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型☆13,757Jan 13, 2025Updated last year
- An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.☆1,946Aug 9, 2025Updated 6 months ago
- Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集☆3,055Apr 14, 2024Updated last year
- Aligning pretrained language models with instruction data generated by themselves.☆4,571Mar 27, 2023Updated 2 years ago
- Companion code to https://arxiv.org/abs/2402.15491☆22Sep 18, 2025Updated 5 months ago
- chatglm-6b微调/LORA/PPO/推理, 样本为自动生成的整数/小数加减乘除运算, 可gpu/cpu☆165Aug 24, 2023Updated 2 years ago
- TigerBot: A multi-language multi-task LLM☆2,262Dec 28, 2024Updated last year
- An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks☆2,076Nov 16, 2023Updated 2 years ago
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,635Oct 24, 2024Updated last year
- Official code for AAAI2023 paper`Confucius: Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum`☆45Feb 9, 2025Updated last year
- [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents☆477Sep 6, 2024Updated last year
- Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)☆2,694Aug 14, 2024Updated last year