Ber666/ToolkenGPT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Ber666/ToolkenGPT)

Ber666 / ToolkenGPT

ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)

☆271

Alternatives and similar repositories for ToolkenGPT

Users that are interested in ToolkenGPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Junjie-Ye / ToolEyes
View on GitHub
[COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios
☆74May 13, 2025Updated last year
thunlp / ToolLearningPapers
View on GitHub
☆923Jul 24, 2024Updated 2 years ago
OpenBMB / ToolBench
View on GitHub
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
☆5,710May 21, 2025Updated last year
Ber666 / RAP
View on GitHub
Reasoning with Language Model is Planning with World Model
☆197Aug 25, 2023Updated 2 years ago
zjunlp / TRICE
View on GitHub
[NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback
☆43Mar 14, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
night-chen / ToolQA
View on GitHub
ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …
☆286Aug 19, 2023Updated 2 years ago
maitrix-org / llm-reasoners
View on GitHub
A library for advanced large language model reasoning
☆2,341Jun 10, 2025Updated last year
THUNLP-MT / StableToolBench
View on GitHub
A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.
☆238Apr 15, 2025Updated last year
ysunbp / RECA-paper
View on GitHub
Code and data for the VLDB 2023 paper: RECA: Related Tables Enhanced Column Semantic Type Annotation Framework
☆12May 7, 2025Updated last year
sambanova / toolbench
View on GitHub
ToolBench, an evaluation suite for LLM tool manipulation capabilities.
☆180Updated this week
princeton-nlp / ALCE
View on GitHub
[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627
☆523Oct 9, 2024Updated last year
lupantech / chameleon-llm
View on GitHub
Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".
☆1,140Dec 23, 2023Updated 2 years ago
Reason-Wang / NAT
View on GitHub
[NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…
☆28Mar 14, 2024Updated 2 years ago
THUDM / AgentBench
View on GitHub
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
☆3,603Feb 8, 2026Updated 5 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
kaistAI / KtrlF
View on GitHub
[NAACL 2024] Official repository for "KTRL+F: Knowledge-Augmented In-Document Search"
☆23Oct 11, 2024Updated last year
lucidrains / toolformer-pytorch
View on GitHub
Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI
☆2,062Jul 22, 2024Updated 2 years ago
joeljang / FLM
View on GitHub
All-in-one repository for Fine-tuning & Pretraining (Large) Language Models
☆15Mar 8, 2023Updated 3 years ago
megagonlabs / sudowoodo
View on GitHub
The source code of the Sudowoodo paper in ICDE 2023
☆19May 24, 2023Updated 3 years ago
szxiangjn / world-model-for-language-model
View on GitHub
☆134Jul 10, 2024Updated 2 years ago
xlang-ai / xlang-paper-reading
View on GitHub
Paper collection on building and evaluating language model agents via executable language grounding
☆364Apr 29, 2024Updated 2 years ago
princeton-nlp / intercode
View on GitHub
[NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898
☆254May 5, 2024Updated 2 years ago
ysymyth / awesome-language-agents
View on GitHub
List of language agents based on paper "Cognitive Architectures for Language Agents"
☆1,247Jan 16, 2025Updated last year
oriyor / reasoning-on-cots
View on GitHub
Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"
☆97Jan 21, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
OSU-NLP-Group / Middleware
View on GitHub
Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)
☆37Dec 29, 2024Updated last year
hhan1018 / NesTools
View on GitHub
[COLING 2025] NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models
☆18Jan 18, 2025Updated last year
OSU-NLP-Group / Mind2Web
View on GitHub
[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist w…
☆1,015Nov 5, 2025Updated 8 months ago
SwiftSage / SwiftSage
View on GitHub
SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks
☆328Oct 22, 2024Updated last year
HowieHwong / MetaTool
View on GitHub
[ICLR'24] MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use
☆115Mar 21, 2024Updated 2 years ago
siyuyuan / coscript
View on GitHub
Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning
☆36Aug 19, 2023Updated 2 years ago
yuyq18 / StepTool
View on GitHub
☆36May 24, 2025Updated last year
open-compass / T-Eval
View on GitHub
[ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step
☆312Apr 3, 2024Updated 2 years ago
WeiminXiong / IPR
View on GitHub
Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)
☆68Oct 18, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
HKUST-KnowComp / PseudoReasoner
View on GitHub
Official code repository for Findings of EMNLP 2022 paper: PseudoReasoner: Leveraging Pseudo Labels for Commonsense Knowledge Base Popula…
☆11Oct 18, 2022Updated 3 years ago
microsoft / simulated-trial-and-error
View on GitHub
☆124Jun 6, 2024Updated 2 years ago
lqtrung1998 / mwp_cot_design
View on GitHub
☆14Oct 11, 2023Updated 2 years ago
ytyz1307zzh / Auto-Instruct
View on GitHub
Code repo for EMNLP 2023 paper "Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models"
☆23Nov 13, 2023Updated 2 years ago
e-spaulding / xpo
View on GitHub
☆12Jun 18, 2024Updated 2 years ago
composable-models / llm_multiagent_debate
View on GitHub
ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate
☆544Apr 24, 2025Updated last year
microsoft / ToRA
View on GitHub
ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting wit…
☆1,122Feb 22, 2024Updated 2 years ago