arunprsh / ChatGPT-Decoded-GPT2-FAQ-Bot-RLHF-PPOLinks
A Practical Guide to Developing a Reliable FAQ Chatbot with Reinforcement Learning and Human Feedback using GPT-2 on AWS
☆14Updated 2 years ago
Alternatives and similar repositories for ChatGPT-Decoded-GPT2-FAQ-Bot-RLHF-PPO
Users that are interested in ChatGPT-Decoded-GPT2-FAQ-Bot-RLHF-PPO are comparing it to the libraries listed below
Sorting:
- 💭 Fine-tune a Covid-19 Doctor-like chatbot with GPT2☆52Updated 4 years ago
- ☆45Updated 2 years ago
- What can I do with a LLM model?☆156Updated 8 months ago
- Financial Domain Question Answering with pre-trained BERT Language Model☆131Updated 5 months ago
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigs…☆184Updated 2 years ago
- Implementation of Reinforcement Learning from Human Feedback (RLHF)☆173Updated 2 years ago
- Repo for fine-tuning Casual LLMs☆458Updated last year
- A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆60Updated 2 years ago
- Scripts for fine-tuning Llama2 via SFT and DPO.☆206Updated 2 years ago
- Finetune BLOOM☆40Updated 2 years ago
- A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆219Updated last year
- A Streamlit app running GPT-2 language model for text classification, built with Pytorch, Transformers and AWS SageMaker.☆39Updated 3 years ago
- ☆99Updated 4 years ago
- ☆123Updated 2 years ago
- Supplementary material for "Understanding Parameter-Efficient Finetuning of Large Language Models: From Prefix Tuning to Adapters"☆46Updated 2 years ago
- ☆56Updated 2 years ago
- Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-…☆566Updated last year
- Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.☆472Updated last year
- Multilingual/multidomain question generation datasets, models, and python library for question generation.☆368Updated last year
- Fine tune a T5 transformer model using PyTorch & Transformers🤗☆219Updated 4 years ago
- ☆40Updated 2 years ago
- ☆69Updated 2 years ago
- The code and data for "StructGPT: A general framework for Large Language Model to Reason on Structured Data"☆103Updated last year
- Tensorflow, Pytorch, Huggingface Transformer, Fastai, etc. tutorial Colab Notebooks.☆77Updated 2 years ago
- Instruct LLMs for flat and nested NER. Fine-tuning Llama and Mistral models for instruction named entity recognition. (Instruction NER)☆87Updated last year
- Implementation of PersonaGPT Dialog Model☆111Updated 4 years ago
- ☆128Updated 2 years ago
- DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI☆517Updated 10 months ago
- A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.☆57Updated last year
- Define Transformers, T5 model and RoBERTa Encoder decoder model for product names generation☆48Updated 4 years ago