arunprsh / ChatGPT-Decoded-GPT2-FAQ-Bot-RLHF-PPOLinks
A Practical Guide to Developing a Reliable FAQ Chatbot with Reinforcement Learning and Human Feedback using GPT-2 on AWS
β14Updated 2 years ago
Alternatives and similar repositories for ChatGPT-Decoded-GPT2-FAQ-Bot-RLHF-PPO
Users that are interested in ChatGPT-Decoded-GPT2-FAQ-Bot-RLHF-PPO are comparing it to the libraries listed below
Sorting:
- π Fine-tune a Covid-19 Doctor-like chatbot with GPT2β51Updated 4 years ago
- β44Updated last year
- Financial Domain Question Answering with pre-trained BERT Language Modelβ126Updated last month
- Implementation of Reinforcement Learning from Human Feedback (RLHF)β172Updated 2 years ago
- β40Updated 2 years ago
- β100Updated 3 years ago
- Prompt Fine-tuning on GLM, BART and Flan-T5.β21Updated 2 years ago
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigsβ¦β185Updated last year
- A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Humanβ¦β58Updated 2 years ago
- A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Humanβ¦β215Updated last year
- β112Updated 2 weeks ago
- A Streamlit app running GPT-2 language model for text classification, built with Pytorch, Transformers and AWS SageMaker.β39Updated 3 years ago
- DIET Classifier mini implementation on pytorch.β46Updated 4 months ago
- DST(Dialogue State Tracker) for LLM(Large Language Model)β23Updated last year
- β12Updated last year
- use chatGLM to perform text embeddingβ45Updated 2 years ago
- β23Updated 4 years ago
- A question-answering dataset with a focus on subjective informationβ45Updated last year
- Few-Shot-Intent-Detection includes popular challenging intent detection datasets with/without OOS queries and state-of-the-art baselines β¦β142Updated last year
- fine tuning mistral 7B using Huggingface, Weights and Biases, Choline, and Vast AIβ38Updated last year
- Fine-Tuning LLM and embedding modelsβ27Updated last year
- Source code to reproduce results of our paper "DIET: Lightweight Language Understanding for Dialogue Systems"β62Updated 5 years ago
- Finetune BLOOMβ40Updated 2 years ago
- GLM (General Language Model)β24Updated 3 years ago
- Implementation of PersonaGPT Dialog Modelβ111Updated 3 years ago
- β60Updated 4 years ago
- We use bert model to fine-tune dialogue task.β22Updated 6 years ago
- A minimum example of aligning language models with RLHF similar to ChatGPTβ219Updated last year
- A unified versatile interface for dialogue datasetsβ17Updated last year
- Fine-tuning GPT-2 Small for Question Answeringβ130Updated 2 years ago