ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)
☆271Apr 18, 2024Updated last year
Alternatives and similar repositories for ToolkenGPT
Users that are interested in ToolkenGPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆73May 13, 2025Updated 11 months ago
- ☆919Jul 24, 2024Updated last year
- A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.☆225Apr 15, 2025Updated last year
- [ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.☆5,594May 21, 2025Updated 10 months ago
- [NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback☆43Mar 14, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Reasoning with Language Model is Planning with World Model☆191Aug 25, 2023Updated 2 years ago
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …☆285Aug 19, 2023Updated 2 years ago
- A library for advanced large language model reasoning☆2,339Jun 10, 2025Updated 10 months ago
- Code and data for the VLDB 2023 paper: RECA: Related Tables Enhanced Column Semantic Type Annotation Framework☆12May 7, 2025Updated 11 months ago
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆173Feb 28, 2024Updated 2 years ago
- [EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627☆513Oct 9, 2024Updated last year
- Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".☆1,138Dec 23, 2023Updated 2 years ago
- [NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…☆28Mar 14, 2024Updated 2 years ago
- [COLING 2025] NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models☆18Jan 18, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)☆3,316Feb 8, 2026Updated 2 months ago
- [NAACL 2024] Official repository for "KTRL+F: Knowledge-Augmented In-Document Search"☆23Oct 11, 2024Updated last year
- Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI☆2,056Jul 22, 2024Updated last year
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Mar 8, 2023Updated 3 years ago
- ☆133Jul 10, 2024Updated last year
- Paper collection on building and evaluating language model agents via executable language grounding☆365Apr 29, 2024Updated last year
- The source code of the Sudowoodo paper in ICDE 2023☆18May 24, 2023Updated 2 years ago
- List of language agents based on paper "Cognitive Architectures for Language Agents"☆1,196Jan 16, 2025Updated last year
- [NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898☆246May 5, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆37Dec 29, 2024Updated last year
- Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"☆95Jan 21, 2024Updated 2 years ago
- [ICLR'24] MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use☆112Mar 21, 2024Updated 2 years ago
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist w…☆975Nov 5, 2025Updated 5 months ago
- SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks☆325Oct 22, 2024Updated last year
- ☆35May 24, 2025Updated 10 months ago
- This repository collects awesome representative papers and resources for "From Pre-training to Post-training: A Survey on Time Series Fou…☆33Feb 1, 2026Updated 2 months ago
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning☆36Aug 19, 2023Updated 2 years ago
- [ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step☆306Apr 3, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆66Oct 18, 2024Updated last year
- Official code repository for Findings of EMNLP 2022 paper: PseudoReasoner: Leveraging Pseudo Labels for Commonsense Knowledge Base Popula…☆11Oct 18, 2022Updated 3 years ago
- ☆123Jun 6, 2024Updated last year
- ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate☆520Apr 24, 2025Updated 11 months ago
- Code repo for EMNLP 2023 paper "Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models"☆23Nov 13, 2023Updated 2 years ago
- ☆14Oct 11, 2023Updated 2 years ago
- ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting wit…☆1,114Feb 22, 2024Updated 2 years ago