Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools
☆144Apr 5, 2023Updated 2 years ago
Alternatives and similar repositories for toolformer
Users that are interested in toolformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI☆2,056Jul 22, 2024Updated last year
- ☆380Mar 10, 2023Updated 3 years ago
- ☆12Nov 15, 2022Updated 3 years ago
- Code for EMNLP 2021 paper "Measuring Association Between Labels and Free-Text Rationales"☆12Sep 12, 2023Updated 2 years ago
- React app implementing OpenAI and Google APIs to re-create behavior of the toolformer paper.☆232Apr 6, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- [NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback☆43Mar 14, 2024Updated 2 years ago
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆87Dec 14, 2023Updated 2 years ago
- ☆12Mar 12, 2021Updated 5 years ago
- This repo explores how AMR to address tasks difficult for LLMs☆13Jan 15, 2024Updated 2 years ago
- ☆11Jan 13, 2013Updated 13 years ago
- Single-header logger with pretty console output☆20Mar 23, 2026Updated last week
- Seq2seqAttGeneration, an basic implementation of text generation that using seq2seq attention model to generate poem series. this project…☆18Jan 11, 2021Updated 5 years ago
- Python package for rematerialization-aware gradient checkpointing☆27Oct 31, 2023Updated 2 years ago
- YouTube Assistant☆12May 15, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by discarding lo…☆16Nov 27, 2024Updated last year
- ☆24Sep 3, 2024Updated last year
- Implementation of Reinforcement Learning from Human Feedback (RLHF)☆174Apr 7, 2023Updated 2 years ago
- ChatGPT API Usage using LangChain, LlamaIndex, Guardrails, AutoGPT and more☆126Aug 16, 2024Updated last year
- [COLING 2025] NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models☆18Jan 18, 2025Updated last year
- Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins☆2,782Dec 5, 2023Updated 2 years ago
- Code for paper "Leakage-Adjusted Simulatability: Can Models Generate Non-Trivial Explanations of Their Behavior in Natural Language?"☆21Oct 13, 2020Updated 5 years ago
- I added selfplay functionality to openai gyms☆10Jan 16, 2021Updated 5 years ago
- Implementation of the user-space eBPF VM based on the iovisor version (https://github.com/iovisor/ubpf)☆13Apr 16, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Complexity Based Prompting for Multi-Step Reasoning☆17Mar 10, 2023Updated 3 years ago
- A web UI for LangChainHub, built on Next.js☆41Jan 28, 2023Updated 3 years ago
- DescriptionPairsExtraction, entity and it's description pairs extract program based on Albert and data back-annotation. 基于Albert与结构化数据回标思…☆20Mar 7, 2022Updated 4 years ago
- NLP的数据增强Demo☆48Feb 28, 2020Updated 6 years ago
- Karras et al. (2022) diffusion models for PyTorch☆12Aug 23, 2022Updated 3 years ago
- Generate, compile and run .java source dynamically at runtime☆11Apr 23, 2019Updated 6 years ago
- Hybrid Deep Sequential Modeling for Social Text-Driven Stock Prediction-Dataset☆22Aug 19, 2018Updated 7 years ago
- A simple Langchain agent setup that makes it easy to test out new agent tools.☆15Apr 8, 2023Updated 2 years ago
- ☆24Apr 3, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- This repository contains source code for the PASTA model, a pre-trained language model for table-based fact verification.☆18Dec 27, 2022Updated 3 years ago
- Parallel Self-Adjusting Computation☆16Jul 5, 2021Updated 4 years ago
- MOSS 003 WebSearchTool: A simple but reliable implementation☆45May 24, 2023Updated 2 years ago
- Chain together LLMs for reasoning & orchestrate multiple large models for accomplishing complex tasks☆609Apr 11, 2023Updated 2 years ago
- ☆173Jun 27, 2023Updated 2 years ago
- Procgen2: A community maintained fork of procgen☆12Aug 25, 2022Updated 3 years ago
- Implementing BERT + CRF with PyTorch for Chinese NER.☆10Mar 7, 2022Updated 4 years ago