affjljoo3581 / GPT2Links
PyTorch Implementation of OpenAI GPT-2
☆338Updated last year
Alternatives and similar repositories for GPT2
Users that are interested in GPT2 are comparing it to the libraries listed below
Sorting:
- Fine tune a T5 transformer model using PyTorch & Transformers🤗☆217Updated 4 years ago
- The PyTorch implementation of fine-tuning the GPT-2(Generative Pre-trained Transformer 2) for dialogue generation.☆176Updated last year
- The universal integrated corpus-building environment.☆31Updated 5 years ago
- Plain pytorch implementation of LLaMA☆188Updated 2 years ago
- Code for the ALiBi method for transformer language models (ICLR 2022)☆539Updated last year
- [NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation☆473Updated last year
- Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-…☆563Updated last year
- Tutorial for how to build BERT from scratch☆98Updated last year
- A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.☆936Updated 2 years ago
- Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation☆1,005Updated 6 years ago
- ☆348Updated 4 years ago
- Prefix-Tuning: Optimizing Continuous Prompts for Generation☆944Updated last year
- Scripts for fine-tuning Llama2 via SFT and DPO.☆203Updated 2 years ago
- Implementation of Reinforcement Learning from Human Feedback (RLHF)☆172Updated 2 years ago
- A minimum example of aligning language models with RLHF similar to ChatGPT☆221Updated last year
- LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA☆225Updated last week
- Dialogue State Tracking (DST) Papers, Datasets, Resources 🤩☆190Updated 2 years ago
- A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆219Updated last year
- Automatically split your PyTorch models on multiple GPUs for training & inference☆658Updated last year
- Parallelformers: An Efficient Model Parallelization Toolkit for Deployment☆791Updated 2 years ago
- Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.☆472Updated last year
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways☆824Updated 2 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆1,411Updated last year
- Fine-tuning GPT-2 Small for Question Answering☆130Updated 2 years ago
- Scalable PaLM implementation of PyTorch☆190Updated 2 years ago
- train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism☆224Updated last year
- Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"☆167Updated 3 years ago
- Large-scale language modeling tutorials with PyTorch☆291Updated 3 years ago
- An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi☆270Updated 2 years ago
- Implementation of the first paper on word2vec☆236Updated 3 years ago