lucidrains/toolformer-pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lucidrains/toolformer-pytorch)

lucidrains / toolformer-pytorch

Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI

☆2,063

Alternatives and similar repositories for toolformer-pytorch

Users that are interested in toolformer-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

conceptofmind / toolformer
View on GitHub
☆385Mar 10, 2023Updated 3 years ago
xrsrke / toolformer
View on GitHub
Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools
☆146Apr 5, 2023Updated 3 years ago
minosvasilias / toolformer-zero
View on GitHub
React app implementing OpenAI and Google APIs to re-create behavior of the toolformer paper.
☆231Apr 6, 2023Updated 3 years ago
OpenBMB / ToolBench
View on GitHub
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
☆5,715May 21, 2025Updated last year
yizhongw / self-instruct
View on GitHub
Aligning pretrained language models with instruction data generated by themselves.
☆4,607Mar 27, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
tatsu-lab / stanford_alpaca
View on GitHub
Code and documentation to train Stanford's Alpaca models, and generate the data.
☆30,243Jul 17, 2024Updated 2 years ago
CarperAI / trlx
View on GitHub
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
☆4,752Jan 8, 2024Updated 2 years ago
huggingface / peft
View on GitHub
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
☆21,460Updated this week
ShishirPatil / gorilla
View on GitHub
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
☆12,970Apr 13, 2026Updated 3 months ago
tloen / alpaca-lora
View on GitHub
Instruct-tune LLaMA on consumer hardware
☆18,913Jul 29, 2024Updated 2 years ago
huggingface / trl
View on GitHub
Train transformer language models with reinforcement learning.
☆18,953Updated this week
amazon-science / mm-cot
View on GitHub
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)
☆3,987Jun 12, 2024Updated 2 years ago
lm-sys / FastChat
View on GitHub
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
☆39,508May 1, 2026Updated 2 months ago
nlpxucan / WizardLM
View on GitHub
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
☆9,484Jun 7, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
OpenGVLab / LLaMA-Adapter
View on GitHub
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
☆5,915Mar 14, 2024Updated 2 years ago
microsoft / unilm
View on GitHub
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
☆22,173Jan 23, 2026Updated 6 months ago
artidoro / qlora
View on GitHub
QLoRA: Efficient Finetuning of Quantized LLMs
☆10,975Jun 10, 2024Updated 2 years ago
lucidrains / PaLM-rlhf-pytorch
View on GitHub
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
☆7,865Updated this week
microsoft / LMOps
View on GitHub
General technology for enabling AI capabilities w/ LLMs and MLLMs
☆4,450Updated this week
bhargaviparanjape / language-programmes
View on GitHub
☆173Jun 27, 2023Updated 3 years ago
OpenBMB / BMTools
View on GitHub
Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins
☆2,770Dec 5, 2023Updated 2 years ago
FMInference / FlexLLMGen
View on GitHub
Running large language models on a single GPU for throughput-oriented scenarios.
☆9,364Oct 28, 2024Updated last year
thunlp / ToolLearningPapers
View on GitHub
☆923Jul 24, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
run-llama / llama_index
View on GitHub
LlamaIndex is the leading document agent and OCR platform
☆51,196Updated this week
togethercomputer / RedPajama-Data
View on GitHub
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
☆4,975Jun 3, 2026Updated last month
anthropics / hh-rlhf
View on GitHub
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
☆1,853Jun 17, 2025Updated last year
Instruction-Tuning-with-GPT-4 / GPT-4-LLM
View on GitHub
Instruction Tuning with GPT-4
☆4,333Jun 11, 2023Updated 3 years ago
BlinkDL / RWKV-LM
View on GitHub
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)…
☆14,643Jul 23, 2026Updated last week
THUDM / AgentBench
View on GitHub
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
☆3,614Feb 8, 2026Updated 5 months ago
mosaicml / llm-foundry
View on GitHub
LLM training code for Databricks foundation models
☆4,432Mar 25, 2026Updated 4 months ago
noahshinn / reflexion
View on GitHub
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
☆3,214Jan 14, 2025Updated last year
microsoft / JARVIS
View on GitHub
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
☆25,109Jul 29, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
allenai / RL4LMs
View on GitHub
A modular RL library to fine-tune language models to human preferences
☆2,393Mar 1, 2024Updated 2 years ago
Dao-AILab / flash-attention
View on GitHub
Fast and memory-efficient exact attention
☆24,568Updated this week
databrickslabs / dolly
View on GitHub
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
☆10,805Jun 30, 2023Updated 3 years ago
OptimalScale / LMFlow
View on GitHub
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
☆8,487May 22, 2026Updated 2 months ago
zai-org / GLM-130B
View on GitHub
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
☆7,656Jul 25, 2023Updated 3 years ago
princeton-nlp / tree-of-thought-llm
View on GitHub
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
☆6,033Jan 16, 2025Updated last year
langchain-ai / langchain
View on GitHub
The agent engineering platform.
☆142,699Updated this week