arunprsh / ChatGPT-Decoded-GPT2-FAQ-Bot-RLHF-PPOLinks
A Practical Guide to Developing a Reliable FAQ Chatbot with Reinforcement Learning and Human Feedback using GPT-2 on AWS
☆14Updated 2 years ago
Alternatives and similar repositories for ChatGPT-Decoded-GPT2-FAQ-Bot-RLHF-PPO
Users that are interested in ChatGPT-Decoded-GPT2-FAQ-Bot-RLHF-PPO are comparing it to the libraries listed below
Sorting:
- 💭 Fine-tune a Covid-19 Doctor-like chatbot with GPT2☆51Updated 4 years ago
- ☆44Updated 2 years ago
- Financial Domain Question Answering with pre-trained BERT Language Model☆126Updated this week
- A Streamlit app running GPT-2 language model for text classification, built with Pytorch, Transformers and AWS SageMaker.☆39Updated 3 years ago
- Implementation of Reinforcement Learning from Human Feedback (RLHF)☆171Updated 2 years ago
- ☆99Updated 4 years ago
- A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆58Updated 2 years ago
- Fine tune a T5 transformer model using PyTorch & Transformers🤗☆216Updated 4 years ago
- A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆218Updated last year
- Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.☆471Updated last year
- Scripts for fine-tuning Llama2 via SFT and DPO.☆200Updated last year
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigs…☆185Updated 2 years ago
- Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-…☆560Updated last year
- What can I do with a LLM model?☆157Updated 3 months ago
- A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.☆55Updated last year
- Few-Shot-Intent-Detection includes popular challenging intent detection datasets with/without OOS queries and state-of-the-art baselines …☆144Updated last year
- Evaluating ChatGPT’s Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness☆144Updated 11 months ago
- LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers☆51Updated 2 years ago
- User-friendly LLaMA: Train or Run the model using PyTorch. Nothing else.☆338Updated 2 years ago
- ☆122Updated 2 years ago
- TEXTOIR: An Integrated and Visualized Platform for Text Open Intent Recognition (ACL 2021)☆52Updated 2 years ago
- Fine-tuning GPT-2 Small for Question Answering☆130Updated 2 years ago
- ☆69Updated 2 years ago
- Open efforts to implement ChatGPT-like models and beyond.☆108Updated 11 months ago
- This repository is the official implementation of our paper MVP: Multi-task Supervised Pre-training for Natural Language Generation.☆72Updated 2 years ago
- Prompt Fine-tuning on GLM, BART and Flan-T5.☆21Updated 2 years ago
- This project collects awesome resources (e.g., papers, open-source models) for large language model (LLM)☆252Updated last year
- Supplementary material for "Understanding Parameter-Efficient Finetuning of Large Language Models: From Prefix Tuning to Adapters"☆46Updated 2 years ago
- Implementation of PersonaGPT Dialog Model☆111Updated 3 years ago
- Collection of links, tutorials and best practices of how to collect the data and build end-to-end RLHF system to finetune Generative AI m…☆222Updated last year