arunprsh / ChatGPT-Decoded-GPT2-FAQ-Bot-RLHF-PPO
A Practical Guide to Developing a Reliable FAQ Chatbot with Reinforcement Learning and Human Feedback using GPT-2 on AWS
☆12Updated 2 years ago
Alternatives and similar repositories for ChatGPT-Decoded-GPT2-FAQ-Bot-RLHF-PPO
Users that are interested in ChatGPT-Decoded-GPT2-FAQ-Bot-RLHF-PPO are comparing it to the libraries listed below
Sorting:
- ☆44Updated last year
- A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆58Updated 2 years ago
- A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆213Updated 11 months ago
- This repository is the official implementation of our paper MVP: Multi-task Supervised Pre-training for Natural Language Generation.☆72Updated 2 years ago
- A Streamlit app running GPT-2 language model for text classification, built with Pytorch, Transformers and AWS SageMaker.☆39Updated 3 years ago
- 💭 Fine-tune a Covid-19 Doctor-like chatbot with GPT2☆51Updated 4 years ago
- ☆110Updated 3 months ago
- ☆60Updated 4 years ago
- Multi-Turn Chatbot with GPT-Neo and SageMaker: A conversational AI system for engaging and informative interactions with users.☆7Updated 2 years ago
- ☆40Updated 2 years ago
- Scripts for fine-tuning Llama2 via SFT and DPO.☆200Updated last year
- Implementation of Reinforcement Learning from Human Feedback (RLHF)☆173Updated 2 years ago
- We finetune Bloomz-7b1-mt using LoRA with the chatdoctor-200k dataset at here https://huggingface.co/LinhDuong/doctorwithbloomz-7b1-mt an…☆30Updated 2 years ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆76Updated 6 months ago
- Fine-Tuning LLM and embedding models☆27Updated last year
- A dataset for training/evaluating Question Answering Retrieval models on ChatGPT responses with the possibility to training/evaluating on…☆141Updated last year
- Tensorflow, Pytorch, Huggingface Transformer, Fastai, etc. tutorial Colab Notebooks.☆75Updated 2 years ago
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigs…☆185Updated last year
- Benchmarking various Deep Learning models such as BERT, ALBERT, BiLSTMs on the task of sentence entailment using two datasets - MultiNLI …☆28Updated 4 years ago
- Few-Shot-Intent-Detection includes popular challenging intent detection datasets with/without OOS queries and state-of-the-art baselines …☆141Updated last year
- A Multilingual Replicable Instruction-Following Model☆93Updated last year
- ☆99Updated 3 years ago
- Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation☆198Updated last year
- A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.☆55Updated last year
- DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI☆501Updated 3 months ago
- FRAKE: Fusional Real-time Automatic Keyword Extraction☆21Updated last year
- A Multi-Turn Dialogue Corpus based on Alpaca Instructions☆171Updated last year
- This repository contains the code to train flan t5 with alpaca instructions and low rank adaptation.☆51Updated last year
- This project collects awesome resources (e.g., papers, open-source models) for large language model (LLM)☆249Updated last year
- Pretraining a GPT from scratch with your own custom domain data and Amazon SageMaker☆7Updated 2 years ago