arunprsh / ChatGPT-Decoded-GPT2-FAQ-Bot-RLHF-PPO
A Practical Guide to Developing a Reliable FAQ Chatbot with Reinforcement Learning and Human Feedback using GPT-2 on AWS
☆12Updated 2 years ago
Alternatives and similar repositories for ChatGPT-Decoded-GPT2-FAQ-Bot-RLHF-PPO:
Users that are interested in ChatGPT-Decoded-GPT2-FAQ-Bot-RLHF-PPO are comparing it to the libraries listed below
- Finetune BLOOM☆40Updated last year
- ☆40Updated last year
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigs…☆185Updated last year
- Efficient Attention for Long Sequence Processing☆92Updated last year
- ☆46Updated last year
- 💭 Fine-tune a Covid-19 Doctor-like chatbot with GPT2☆51Updated 4 years ago
- A Streamlit app running GPT-2 language model for text classification, built with Pytorch, Transformers and AWS SageMaker.☆39Updated 3 years ago
- ☆67Updated 2 years ago
- Multi-Turn Chatbot with GPT-Neo and SageMaker: A conversational AI system for engaging and informative interactions with users.☆7Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆67Updated 3 months ago
- A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.☆55Updated 9 months ago
- realize the reinforcement learning training for gpt2 llama bloom and so on llm model☆26Updated last year
- Scripts for fine-tuning Llama2 via SFT and DPO.☆194Updated last year
- Few-Shot-Intent-Detection includes popular challenging intent detection datasets with/without OOS queries and state-of-the-art baselines …☆136Updated last year
- A simple example for finetuning HuggingFace T5 model. Includes code for intermediate generation.☆27Updated 4 years ago
- Fine tune a T5 transformer model using PyTorch & Transformers🤗☆206Updated 4 years ago
- We use bert model to fine-tune dialogue task.☆22Updated 5 years ago
- Source code to reproduce results of our paper "DIET: Lightweight Language Understanding for Dialogue Systems"☆61Updated 4 years ago
- An Evaluation of ChatGPT on Information Extraction task, including Named Entity Recognition (NER), Relation Extraction (RE), Event Extrac…☆132Updated last year
- ☆121Updated last year
- The Few-Shot Bot: Prompt-Based Learning for Dialogue Systems☆119Updated 3 years ago
- DST(Dialogue State Tracker) for LLM(Large Language Model)☆22Updated last year
- A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆58Updated last year
- Instruct LLMs for flat and nested NER. Fine-tuning Llama and Mistral models for instruction named entity recognition. (Instruction NER)☆80Updated 9 months ago
- DIET Classifier mini implementation on pytorch.☆45Updated 2 weeks ago
- ☆61Updated 4 years ago
- Patch for MPT-7B which allows using and training a LoRA☆58Updated last year
- Instruct-tune Open LLaMA / RedPajama / StableLM models on consumer hardware using QLoRA☆80Updated last year
- ☆99Updated 3 years ago
- A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆211Updated 9 months ago