tangqiaoyu/ToolAlpaca

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tangqiaoyu/ToolAlpaca)

tangqiaoyu / ToolAlpaca

the official code for "ToolAlpaca: Generalized Tool Learning for Language Models with 3000 Simulated Cases"

☆880

Alternatives and similar repositories for ToolAlpaca

Users that are interested in ToolAlpaca are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OpenBMB / ToolBench
View on GitHub
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
☆5,708May 21, 2025Updated last year
HowieHwong / MetaTool
View on GitHub
[ICLR'24] MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use
☆115Mar 21, 2024Updated 2 years ago
Junjie-Ye / ToolEyes
View on GitHub
[COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios
☆74May 13, 2025Updated last year
thunlp / ToolLearningPapers
View on GitHub
☆923Jul 24, 2024Updated 2 years ago
THUNLP-MT / StableToolBench
View on GitHub
A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.
☆237Apr 15, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
open-compass / T-Eval
View on GitHub
[ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step
☆312Apr 3, 2024Updated 2 years ago
night-chen / ToolQA
View on GitHub
ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …
☆286Aug 19, 2023Updated 2 years ago
JoeYing1019 / UltraTool
View on GitHub
[ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios
☆71Aug 5, 2025Updated 11 months ago
sambanova / toolbench
View on GitHub
ToolBench, an evaluation suite for LLM tool manipulation capabilities.
☆180Updated this week
THUDM / AgentTuning
View on GitHub
AgentTuning: Enabling Generalized Agent Abilities for LLMs
☆1,500Oct 31, 2023Updated 2 years ago
THUDM / AgentBench
View on GitHub
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
☆3,601Feb 8, 2026Updated 5 months ago
xlang-ai / xlang-paper-reading
View on GitHub
Paper collection on building and evaluating language model agents via executable language grounding
☆364Apr 29, 2024Updated 2 years ago
Junjie-Ye / RoTBench
View on GitHub
[EMNLP 2024] RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning
☆15May 13, 2025Updated last year
PhoebusSi / Alpaca-CoT
View on GitHub
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tunin…
☆2,791Dec 12, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
AlibabaResearch / DAMO-ConvAI
View on GitHub
DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.
☆1,569Jun 10, 2026Updated last month
xingyaoww / mint-bench
View on GitHub
Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Ziha…
☆141Jun 4, 2024Updated 2 years ago
OpenBMB / BMTools
View on GitHub
Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins
☆2,769Dec 5, 2023Updated 2 years ago
sufengniu / RefGPT
View on GitHub
☆164Apr 17, 2023Updated 3 years ago
Ber666 / ToolkenGPT
View on GitHub
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)
☆271Apr 18, 2024Updated 2 years ago
AILab-CVC / GPT4Tools
View on GitHub
GPT4Tools is an intelligent system that can automatically decide, control, and utilize different visual foundation models, allowing the u…
☆771Dec 19, 2023Updated 2 years ago
nexusflowai / NexusRaven
View on GitHub
NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…
☆326Sep 29, 2023Updated 2 years ago
icip-cas / awesome-auto-alignment
View on GitHub
Collection of papers for scalable automated alignment.
☆92Oct 22, 2024Updated last year
quchangle1 / LLM-Tool-Survey
View on GitHub
This is the repository for the Tool Learning survey.
☆485Aug 9, 2025Updated 11 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
hkust-nlp / AgentBoard
View on GitHub
An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]
☆427May 20, 2024Updated 2 years ago
zai-org / ChatGLM3
View on GitHub
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
☆13,670Jan 13, 2025Updated last year
tatsu-lab / alpaca_eval
View on GitHub
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
☆2,007Aug 9, 2025Updated 11 months ago
gao-xiao-bai / JsonTuning
View on GitHub
JsonTuning: Towards Generalizable, Robust, and Controllable Instruction Tuning
☆10Nov 3, 2024Updated last year
open-compass / GTA
View on GitHub
[NeurIPS 2024 D&B] GTA: A Benchmark for General Tool Agents & [arXiv 2026] GTA-2
☆147Apr 20, 2026Updated 3 months ago
baichuan-inc / Baichuan-7B
View on GitHub
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
☆5,650Jul 18, 2024Updated 2 years ago
yangjianxin1 / Firefly
View on GitHub
Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…
☆6,647Oct 24, 2024Updated last year
CVI-SZU / Linly
View on GitHub
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型；ChatFlow中文对话模型；中文OpenLLaMA模型；NLP预训练/指令微调数据集
☆3,046Apr 14, 2024Updated 2 years ago
zorazrw / awesome-tool-llm
View on GitHub
☆248Aug 14, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
yizhongw / self-instruct
View on GitHub
Aligning pretrained language models with instruction data generated by themselves.
☆4,606Mar 27, 2023Updated 3 years ago
Shiguang-Guo / Open-Grounded-Planning
View on GitHub
☆11Jun 11, 2024Updated 2 years ago
ShishirPatil / gorilla
View on GitHub
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
☆12,961Apr 13, 2026Updated 3 months ago
baichuan-inc / Baichuan2
View on GitHub
A series of large language models developed by Baichuan Intelligent Technology
☆4,090Nov 8, 2024Updated last year
quchangle1 / COLT
View on GitHub
The implementation for CIKM 2024: Towards Completeness-Oriented Tool Retrieval for Large Language Models.
☆26Nov 6, 2024Updated last year
anchen1011 / FireAct
View on GitHub
FireAct: Toward Language Agent Fine-tuning
☆296Oct 22, 2023Updated 2 years ago
open-compass / opencompass
View on GitHub
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, …
☆7,235Updated this week