destilabs / optimautogpt
Framework for finetunning the ToolFormer-based LM in a few shots manner
☆24Updated last year
Alternatives and similar repositories for optimautogpt:
Users that are interested in optimautogpt are comparing it to the libraries listed below
- ☆49Updated last year
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆164Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆38Updated 11 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆67Updated 4 months ago
- minimal LLM scripts for 24GB VRAM GPUs. training, inference, whatever☆37Updated 2 weeks ago
- Based on the tree of thoughts paper☆46Updated last year
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆94Updated last year
- Evaluating tool-augmented LLMs in conversation settings☆77Updated 8 months ago
- Here is a Google Colab Notebook for fine-tuning Alpaca Lora (within 3 hours with a 40GB A100 GPU)☆38Updated last year
- Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools☆136Updated last year
- ☆84Updated last year
- ☆33Updated last year
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆100Updated 5 months ago
- Reward Model framework for LLM RLHF☆60Updated last year
- CLIR version of ColBERT☆67Updated 4 months ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆77Updated 10 months ago
- Writing Blog Posts with Generative Feedback Loops!☆47Updated 11 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆40Updated 10 months ago
- Instruct-tune Open LLaMA / RedPajama / StableLM models on consumer hardware using QLoRA☆80Updated last year
- A repository for transformer critique learning and generation☆88Updated last year
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆107Updated last year
- A set of utilities for running few-shot prompting experiments on large-language models☆118Updated last year
- Reimplementation of the task generation part from the Alpaca paper☆119Updated last year
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated last year
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆93Updated last year
- For experiments involving instruct gpt. Currently used for documenting open research questions.☆71Updated 2 years ago
- Retrieval Augmented Generation Generalized Evaluation Dataset☆51Updated 3 months ago
- No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval☆28Updated 2 years ago
- ☆93Updated 2 months ago