xrsrke/toolformer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xrsrke/toolformer)

xrsrke / toolformer

Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools

☆146

Alternatives and similar repositories for toolformer

Users that are interested in toolformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lucidrains / toolformer-pytorch
View on GitHub
Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI
☆2,062Jul 22, 2024Updated 2 years ago
conceptofmind / toolformer
View on GitHub
☆385Mar 10, 2023Updated 3 years ago
thunlp / EREN
View on GitHub
Official codes for COLING 2024 paper "Robust and Scalable Model Editing for Large Language Models": https://arxiv.org/abs/2403.17431v1
☆14Mar 27, 2024Updated 2 years ago
mrcabbage972 / simple-toolformer
View on GitHub
A Python implementation of Toolformer using Huggingface Transformers
☆14Mar 20, 2023Updated 3 years ago
eric-mitchell / concord
View on GitHub
☆14Nov 15, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
allenai / label_rationale_association
View on GitHub
Code for EMNLP 2021 paper "Measuring Association Between Labels and Free-Text Rationales"
☆12Sep 12, 2023Updated 2 years ago
minosvasilias / toolformer-zero
View on GitHub
React app implementing OpenAI and Google APIs to re-create behavior of the toolformer paper.
☆231Apr 6, 2023Updated 3 years ago
zjunlp / TRICE
View on GitHub
[NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback
☆43Mar 14, 2024Updated 2 years ago
xrsrke / pipegoose
View on GitHub
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*
☆87Dec 14, 2023Updated 2 years ago
veerarc / RelationExtraction_keras
View on GitHub
☆12Mar 12, 2021Updated 5 years ago
CyberAgentAILab / filtered-dpo
View on GitHub
[EMNLP 2024] Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by …
☆16Nov 27, 2024Updated last year
nakagami / reportlab
View on GitHub
☆12Jul 5, 2026Updated 3 weeks ago
BUPT-LawLLM / LawLLM
View on GitHub
该仓库是 BUPT 智能系统实验室的法律大模型项目，基于 ChatGLM 等开源大模型进行实现。
☆11Nov 28, 2023Updated 2 years ago
liuhuanyong / Seq2seqAttGeneration
View on GitHub
Seq2seqAttGeneration, an basic implementation of text generation that using seq2seq attention model to generate poem series. this project…
☆18Jan 11, 2021Updated 5 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
OpenBMB / BMTools
View on GitHub
Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins
☆2,770Dec 5, 2023Updated 2 years ago
peterbhase / LAS-NL-Explanations
View on GitHub
Code for paper "Leakage-Adjusted Simulatability: Can Models Generate Non-Trivial Explanations of Their Behavior in Natural Language?"
☆21Oct 13, 2020Updated 5 years ago
FranxYao / Complexity-Based-Prompting
View on GitHub
Complexity Based Prompting for Multi-Step Reasoning
☆17Mar 10, 2023Updated 3 years ago
jeffhj / domain-relevance
View on GitHub
The implementation for "Measuring Fine-Grained Domain Relevance of Terms: A Hierarchical Core-Fringe Approach" (ACL '21)
☆16Jun 13, 2021Updated 5 years ago
p-quic / ubpf
View on GitHub
Implementation of the user-space eBPF VM based on the iovisor version (https://github.com/iovisor/ubpf)
☆13Apr 16, 2020Updated 6 years ago
Ibrahimmohammedevuti / Binance-Arbitrage-Scanner-and-Auto-Trader
View on GitHub
This is an arbitrage scanner and auto trading bot created using python code, this bot works for Bybit, Binance, Kucoin, OKX and Bitget, y…
☆10Apr 27, 2024Updated 2 years ago
wuhuizhe / CHRNN
View on GitHub
Hybrid Deep Sequential Modeling for Social Text-Driven Stock Prediction-Dataset
☆22Aug 19, 2018Updated 7 years ago
FanaHOVA / langchain-hub-ui
View on GitHub
A web UI for LangChainHub, built on Next.js
☆41Jan 28, 2023Updated 3 years ago
PlusLabNLP / GENEVA
View on GitHub
☆12Aug 15, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
wepe / MachineLearningEveryday
View on GitHub
record and share my reading everyday
☆12Apr 1, 2016Updated 10 years ago
msakuta / VastSpace
View on GitHub
Space war simulation game engine in real and vast scale
☆14Nov 30, 2021Updated 4 years ago
jina-ai / agentchain
View on GitHub
Chain together LLMs for reasoning & orchestrate multiple large models for accomplishing complex tasks
☆610Apr 11, 2023Updated 3 years ago
ruc-datalab / PASTA
View on GitHub
This repository contains source code for the PASTA model, a pre-trained language model for table-based fact verification.
☆18Dec 27, 2022Updated 3 years ago
OSU-NLP-Group / reversal-curse-binding
View on GitHub
☆25Apr 3, 2025Updated last year
OpenLMLab / MOSS_WebSearchTool
View on GitHub
MOSS 003 WebSearchTool: A simple but reliable implementation
☆45May 24, 2023Updated 3 years ago
RocioLiu / bert_chinese_ner
View on GitHub
Implementing BERT + CRF with PyTorch for Chinese NER.
☆10Mar 7, 2022Updated 4 years ago
bhargaviparanjape / language-programmes
View on GitHub
☆173Jun 27, 2023Updated 3 years ago
Farama-Foundation / Procgen-Staging
View on GitHub
Procgen2: A community maintained fork of procgen
☆12Aug 25, 2022Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
StonyBrookNLP / tellmewhy
View on GitHub
Website for release of TellMeWhy dataset for why question answering
☆14Nov 11, 2022Updated 3 years ago
SciFin-Team / SciFin
View on GitHub
SciFin is a python package for Science & Finance.
☆11Oct 25, 2020Updated 5 years ago
astordu / r1-reasoning-rag
View on GitHub
recursive rag with r1 reasoning
☆11Feb 26, 2025Updated last year
mathigatti / okCupidScraper
View on GitHub
Download okCupid users public data automatically
☆10Feb 6, 2022Updated 4 years ago
HustMinsLab / BIP
View on GitHub
☆17Nov 28, 2022Updated 3 years ago
SeokwonJung-Jay / MEME-public
View on GitHub
MEME: Multi-Entity & Evolving Memory Evaluation — reference implementation (companion to arXiv preprint)
☆23May 11, 2026Updated 2 months ago
PluviophileYU / CVC-QA
View on GitHub
Code for "Counterfactual Variable Control for Robust and Interpretable Question Answering"
☆14Oct 13, 2020Updated 5 years ago