arunprsh / ChatGPT-Decoded-GPT2-FAQ-Bot-RLHF-PPO
A Practical Guide to Developing a Reliable FAQ Chatbot with Reinforcement Learning and Human Feedback using GPT-2 on AWS
☆12Updated last year
Related projects ⓘ
Alternatives and complementary repositories for ChatGPT-Decoded-GPT2-FAQ-Bot-RLHF-PPO
- ☆46Updated last year
- Financial Domain Question Answering with pre-trained BERT Language Model☆122Updated last year
- A Streamlit app running GPT-2 language model for text classification, built with Pytorch, Transformers and AWS SageMaker.☆39Updated 2 years ago
- This repository contains the code to train flan t5 with alpaca instructions and low rank adaptation.☆47Updated last year
- Instruct LLMs for flat and nested NER. Fine-tuning Llama and Mistral models for instruction named entity recognition. (Instruction NER)☆78Updated 6 months ago
- 💭 Fine-tune a Covid-19 Doctor-like chatbot with GPT2☆49Updated 3 years ago
- This is a simple implementation of how to leverage a Language Model for a prompt-based learning model☆44Updated 2 years ago
- Tensorflow, Pytorch, Huggingface Transformer, Fastai, etc. tutorial Colab Notebooks.☆71Updated last year
- The official repository for paper "It is AI’s Turn to Ask Humans a Question: Question-Answer Pair Generation for Children’s Story Books" …☆30Updated last year
- ☆95Updated 3 years ago
- ☆39Updated last year
- ☆19Updated 5 months ago
- Fine tune a T5 transformer model using PyTorch & Transformers🤗☆198Updated 3 years ago
- A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.☆54Updated 6 months ago
- A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆56Updated last year
- ☆60Updated 3 years ago
- ☆107Updated 6 months ago
- Finetune BLOOM☆40Updated last year
- ☆68Updated last year
- TEXTOIR: An Integrated and Visualized Platform for Text Open Intent Recognition (ACL 2021)☆47Updated 2 years ago
- Efficient Attention for Long Sequence Processing☆89Updated 11 months ago
- Define Transformers, T5 model and RoBERTa Encoder decoder model for product names generation☆48Updated 3 years ago
- Fine-tuning GPT-2 Small for Question Answering☆130Updated last year
- LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers☆50Updated last year
- Few-Shot-Intent-Detection includes popular challenging intent detection datasets with/without OOS queries and state-of-the-art baselines …☆133Updated last year
- The jiant toolkit for general-purpose text understanding models☆21Updated 4 years ago
- realize the reinforcement learning training for gpt2 llama bloom and so on llm model☆26Updated last year
- Data and code for EMNLP 2022 paper "ConvFinQA: Exploring the Chain of Numerical Reasoning in Conversational Finance Question Answering"☆83Updated 2 years ago
- Implementation of Reinforcement Learning from Human Feedback (RLHF)☆169Updated last year
- Multi-Turn Chatbot with GPT-Neo and SageMaker: A conversational AI system for engaging and informative interactions with users.☆7Updated last year