destilabs / optimautogptLinks
Framework for finetunning the ToolFormer-based LM in a few shots manner
☆25Updated 2 years ago
Alternatives and similar repositories for optimautogpt
Users that are interested in optimautogpt are comparing it to the libraries listed below
Sorting:
- A set of utilities for running few-shot prompting experiments on large-language models☆126Updated 2 years ago
- Evaluating tool-augmented LLMs in conversation settings☆88Updated last year
- Based on the tree of thoughts paper☆48Updated 2 years ago
- Reward Model framework for LLM RLHF☆62Updated 2 years ago
- ☆84Updated 2 years ago
- The data processing pipeline for the Koala chatbot language model☆118Updated 2 years ago
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆169Updated 2 years ago
- ☆279Updated 2 years ago
- ☆86Updated 2 years ago
- TART: A plug-and-play Transformer module for task-agnostic reasoning☆202Updated 2 years ago
- Mixing Language Models with Self-Verification and Meta-Verification☆112Updated last year
- This repository implements the chain of verification paper by Meta AI☆195Updated 2 years ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆120Updated 3 months ago
- 📚 Datasets and models for instruction-tuning☆238Updated 2 years ago
- ☆128Updated 2 years ago
- Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools☆144Updated 2 years ago
- Adversarial Training and SFT for Bot Safety Models☆40Updated 2 years ago
- ☆186Updated last year
- ☆33Updated 2 years ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆99Updated 4 months ago
- Weekly visualization report of Open LLM model performance based on 4 metrics.☆86Updated 2 years ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆91Updated 5 months ago
- This repo is for handling Question Answering, especially for Multi-hop Question Answering☆68Updated 2 years ago
- create workflows with LLMs☆55Updated last year
- ☆380Updated 2 years ago
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆100Updated 2 years ago
- SAIL: Search Augmented Instruction Learning☆158Updated 6 months ago
- ☆75Updated 2 years ago
- A re-implementation of Meta-Prompt in LangChain for building self-improving agents.☆64Updated 2 years ago
- Reimplementation of the task generation part from the Alpaca paper☆119Updated 2 years ago