affjljoo3581 / GPT2
PyTorch Implementation of OpenAI GPT-2
☆290Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for GPT2
- Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation☆974Updated 5 years ago
- Prefix-Tuning: Optimizing Continuous Prompts for Generation☆896Updated 6 months ago
- A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.☆923Updated 2 years ago
- Fine tune a T5 transformer model using PyTorch & Transformers🤗☆197Updated 3 years ago
- Code for the ALiBi method for transformer language models (ICLR 2022)☆507Updated last year
- The PyTorch implementation of fine-tuning the GPT-2(Generative Pre-trained Transformer 2) for dialogue generation.☆173Updated 5 months ago
- [ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723☆721Updated 2 years ago
- Code for the paper Fine-Tuning Language Models from Human Preferences☆1,232Updated last year
- [ICLR'23] DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models☆734Updated 8 months ago
- ☆340Updated 3 years ago
- [NIPS2023] RRHF & Wombat☆798Updated last year
- Plain pytorch implementation of LLaMA☆189Updated last year
- ICML'2022: Black-Box Tuning for Language-Model-as-a-Service & EMNLP'2022: BBTv2: Towards a Gradient-Free Future with Large Language Model…☆260Updated 2 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆1,338Updated 8 months ago
- [NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation☆466Updated 8 months ago
- An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi☆254Updated last year
- Parallelformers: An Efficient Model Parallelization Toolkit for Deployment☆778Updated last year
- PyTorch Implementation of OpenAI GPT☆119Updated last year
- Diffusion-LM☆1,058Updated 3 months ago
- Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"☆161Updated 3 years ago
- Fine-tuning GPT-2 Small for Question Answering☆130Updated last year
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)☆457Updated 2 years ago
- train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism☆207Updated last year
- Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.☆466Updated 8 months ago
- Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpe…☆432Updated last year
- User-friendly LLaMA: Train or Run the model using PyTorch. Nothing else.☆328Updated last year
- Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch☆851Updated last year
- Accompanying repo for the RLPrompt paper☆300Updated 5 months ago
- A research project for natural language generation, containing the official implementations by MSRA NLC team.☆692Updated 3 months ago
- Paper list for open-ended language generation☆188Updated 2 years ago