☆380Mar 10, 2023Updated 3 years ago
Alternatives and similar repositories for toolformer
Users that are interested in toolformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Python implementation of Toolformer using Huggingface Transformers☆14Mar 20, 2023Updated 3 years ago
- Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools☆144Apr 5, 2023Updated 2 years ago
- Framework for finetunning the ToolFormer-based LM in a few shots manner☆25Nov 11, 2023Updated 2 years ago
- React app implementing OpenAI and Google APIs to re-create behavior of the toolformer paper.☆232Apr 6, 2023Updated 2 years ago
- ☆12Nov 15, 2022Updated 3 years ago
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,739Jan 8, 2024Updated 2 years ago
- A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer☆1,629Sep 15, 2023Updated 2 years ago
- ☆173Jun 27, 2023Updated 2 years ago
- [ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.☆5,559May 21, 2025Updated 10 months ago
- A repository for transformer critique learning and generation☆89Dec 7, 2023Updated 2 years ago
- ☆1,058May 29, 2023Updated 2 years ago
- Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins☆2,785Dec 5, 2023Updated 2 years ago
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆209Jan 13, 2024Updated 2 years ago
- Aligning pretrained language models with instruction data generated by themselves.☆4,587Mar 27, 2023Updated 2 years ago
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆12,774Mar 11, 2026Updated last week
- This repo explores how AMR to address tasks difficult for LLMs☆13Jan 15, 2024Updated 2 years ago
- General technology for enabling AI capabilities w/ LLMs and MLLMs☆4,289Updated this week
- Code base of In-Context Learning for Dialogue State tracking☆45Sep 24, 2023Updated 2 years ago
- Knowledge Graph Simple Question Answering for Unseen Domains☆13Jul 2, 2025Updated 8 months ago
- [ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models☆3,672Feb 6, 2024Updated 2 years ago
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)☆3,253Feb 8, 2026Updated last month
- Alpaca dataset from Stanford, cleaned and curated☆1,583Mar 7, 2026Updated 2 weeks ago
- [NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning☆3,100Jan 14, 2025Updated last year
- 800,000 step-level correctness labels on LLM solutions to MATH problems☆2,106Jun 1, 2023Updated 2 years ago
- An open-source framework for training large multimodal models.☆4,079Aug 31, 2024Updated last year
- Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)☆3,986Jun 12, 2024Updated last year
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆70Dec 9, 2024Updated last year
- [NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback☆43Mar 14, 2024Updated 2 years ago
- The RedPajama-Data repository contains code for preparing large datasets for training large language models.☆4,929Dec 7, 2024Updated last year
- ☆180Feb 23, 2023Updated 3 years ago
- Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"☆1,832Jun 17, 2025Updated 9 months ago
- Code accompanying the paper Pretraining Language Models with Human Preferences☆180Feb 13, 2024Updated 2 years ago
- ☆80Mar 24, 2025Updated 11 months ago
- Instruction Tuning with GPT-4☆4,338Jun 11, 2023Updated 2 years ago
- Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"☆1,065Mar 7, 2024Updated 2 years ago
- ☆124Feb 21, 2025Updated last year
- An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries☆7,400Feb 3, 2026Updated last month
- ☆24Sep 3, 2024Updated last year
- ☆27Jun 6, 2024Updated last year