arunprsh / ChatGPT-Decoded-GPT2-FAQ-Bot-RLHF-PPO
A Practical Guide to Developing a Reliable FAQ Chatbot with Reinforcement Learning and Human Feedback using GPT-2 on AWS
☆12Updated last year
Related projects ⓘ
Alternatives and complementary repositories for ChatGPT-Decoded-GPT2-FAQ-Bot-RLHF-PPO
- Finetune BLOOM☆40Updated last year
- ☆46Updated last year
- Scripts for fine-tuning Llama2 via SFT and DPO.☆179Updated last year
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigs…☆183Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆64Updated 3 weeks ago
- ☆68Updated last year
- ☆124Updated last year
- ☆121Updated last year
- A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆207Updated 5 months ago
- A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆56Updated last year
- 💭 Fine-tune a Covid-19 Doctor-like chatbot with GPT2☆49Updated 3 years ago
- Implementation of PersonaGPT Dialog Model☆101Updated 3 years ago
- Multilingual/multidomain question generation datasets, models, and python library for question generation.☆324Updated 2 months ago
- DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI☆477Updated 6 months ago
- Fine tune a T5 transformer model using PyTorch & Transformers🤗☆196Updated 3 years ago
- Define Transformers, T5 model and RoBERTa Encoder decoder model for product names generation☆48Updated 3 years ago
- ☆39Updated last year
- ☆94Updated 3 years ago
- Fine-tuning GPT-2 Small for Question Answering☆130Updated last year
- All available datasets for Instruction Tuning of Large Language Models☆236Updated 11 months ago
- Here is a Google Colab Notebook for fine-tuning Alpaca Lora (within 3 hours with a 40GB A100 GPU)☆38Updated last year
- This repository contains the code to train flan t5 with alpaca instructions and low rank adaptation.☆47Updated last year
- Source code to reproduce results of our paper "DIET: Lightweight Language Understanding for Dialogue Systems"☆61Updated 4 years ago
- Crosslingual Generalization through Multitask Finetuning☆515Updated last month
- Open efforts to implement ChatGPT-like models and beyond.☆105Updated 3 months ago
- Long Document Summarization Papers☆136Updated last year
- Implementation of Reinforcement Learning from Human Feedback (RLHF)☆169Updated last year
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …☆239Updated last year
- LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers☆51Updated last year
- What can I do with a LLM model?☆153Updated 5 months ago