the official code for "ToolAlpaca: Generalized Tool Learning for Language Models with 3000 Simulated Cases"
☆885Oct 26, 2024Updated last year
Alternatives and similar repositories for ToolAlpaca
Users that are interested in ToolAlpaca are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR'24] MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use☆114Mar 21, 2024Updated 2 years ago
- [ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.☆5,665May 21, 2025Updated last year
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆74May 13, 2025Updated last year
- ☆922Jul 24, 2024Updated last year
- [ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step☆310Apr 3, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …☆287Aug 19, 2023Updated 2 years ago
- A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.☆235Apr 15, 2025Updated last year
- [ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios☆71Aug 5, 2025Updated 10 months ago
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆179Feb 28, 2024Updated 2 years ago
- AgentTuning: Enabling Generalized Agent Abilities for LLMs☆1,497Oct 31, 2023Updated 2 years ago
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)☆3,492Feb 8, 2026Updated 4 months ago
- Paper collection on building and evaluating language model agents via executable language grounding☆365Apr 29, 2024Updated 2 years ago
- [EMNLP 2024] RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning☆15May 13, 2025Updated last year
- We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tunin…☆2,797Dec 12, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.☆1,558Updated this week
- Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Ziha…☆140Jun 4, 2024Updated 2 years ago
- Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins☆2,773Dec 5, 2023Updated 2 years ago
- ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)☆271Apr 18, 2024Updated 2 years ago
- Official code for AAAI2023 paper`Confucius: Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum`☆47Feb 9, 2025Updated last year
- ☆164Apr 17, 2023Updated 3 years ago
- GPT4Tools is an intelligent system that can automatically decide, control, and utilize different visual foundation models, allowing the u…☆770Dec 19, 2023Updated 2 years ago
- NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…☆323Sep 29, 2023Updated 2 years ago
- This is the repository for the Tool Learning survey.☆484Aug 9, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Collection of papers for scalable automated alignment.☆92Oct 22, 2024Updated last year
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]☆418May 20, 2024Updated 2 years ago
- [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents☆551Sep 6, 2024Updated last year
- ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型☆13,677Jan 13, 2025Updated last year
- An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.☆1,994Aug 9, 2025Updated 10 months ago
- JsonTuning: Towards Generalizable, Robust, and Controllable Instruction Tuning☆10Nov 3, 2024Updated last year
- A large-scale 7B pretraining language model developed by BaiChuan-Inc.☆5,654Jul 18, 2024Updated last year
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,639Oct 24, 2024Updated last year
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆12,895Apr 13, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集☆3,049Apr 14, 2024Updated 2 years ago
- ☆244Aug 14, 2024Updated last year
- Aligning pretrained language models with instruction data generated by themselves.☆4,600Mar 27, 2023Updated 3 years ago
- ☆11Jun 11, 2024Updated 2 years ago
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆72,107Updated this week
- The implementation for CIKM 2024: Towards Completeness-Oriented Tool Retrieval for Large Language Models.☆26Nov 6, 2024Updated last year
- FireAct: Toward Language Agent Fine-tuning☆294Oct 22, 2023Updated 2 years ago