destilabs / optimautogpt
Framework for finetunning the ToolFormer-based LM in a few shots manner
☆24Updated last year
Alternatives and similar repositories for optimautogpt:
Users that are interested in optimautogpt are comparing it to the libraries listed below
- Based on the tree of thoughts paper☆47Updated last year
- ☆49Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆39Updated last year
- A set of utilities for running few-shot prompting experiments on large-language models☆118Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆102Updated 3 months ago
- ☆37Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆47Updated last year
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆34Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆41Updated last year
- Evaluating tool-augmented LLMs in conversation settings☆82Updated 10 months ago
- Adversarial Training and SFT for Bot Safety Models☆39Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 5 months ago
- ☆12Updated last week
- ☆33Updated last year
- A re-implementation of Meta-Prompt in LangChain for building self-improving agents.☆63Updated last year
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆166Updated last year
- LLM reads a paper and produce a working prototype☆51Updated 2 weeks ago
- ☆84Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆73Updated 5 months ago
- Retrieval Augmented Generation Generalized Evaluation Dataset☆52Updated 4 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆80Updated last year
- Reward Model framework for LLM RLHF☆61Updated last year
- Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 q…☆87Updated last year
- A framework for evaluating function calls made by LLMs☆37Updated 8 months ago
- Here is a Google Colab Notebook for fine-tuning Alpaca Lora (within 3 hours with a 40GB A100 GPU)☆38Updated 2 years ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- Track the progress of LLM context utilisation☆54Updated 8 months ago
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆22Updated last year
- A repository for transformer critique learning and generation☆89Updated last year
- Advanced Reasoning Benchmark Dataset for LLMs☆45Updated last year