affjljoo3581 / GPT2
PyTorch Implementation of OpenAI GPT-2
☆323Updated 8 months ago
Alternatives and similar repositories for GPT2:
Users that are interested in GPT2 are comparing it to the libraries listed below
- Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation☆989Updated 5 years ago
- The universal integrated corpus-building environment.☆30Updated 4 years ago
- Code for the paper Fine-Tuning Language Models from Human Preferences☆1,305Updated last year
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆1,378Updated last year
- PyTorch Implementation of OpenAI GPT☆124Updated last year
- Code for the ALiBi method for transformer language models (ICLR 2022)☆519Updated last year
- Prefix-Tuning: Optimizing Continuous Prompts for Generation☆917Updated 11 months ago
- Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"☆164Updated 3 years ago
- A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.☆929Updated 2 years ago
- ☆345Updated 3 years ago
- Parallelformers: An Efficient Model Parallelization Toolkit for Deployment☆785Updated last year
- A simple and working implementation of Electra, the fastest way to pretrain language models from scratch, in Pytorch☆224Updated last year
- A modular RL library to fine-tune language models to human preferences☆2,294Updated last year
- The PyTorch implementation of fine-tuning the GPT-2(Generative Pre-trained Transformer 2) for dialogue generation.☆175Updated 10 months ago
- Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-…☆555Updated 10 months ago
- An open collection of implementation tips, tricks and resources for training large language models☆471Updated 2 years ago
- [ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723☆727Updated 2 years ago
- A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarks☆260Updated 7 months ago
- This PyTorch package implements MoEBERT: from BERT to Mixture-of-Experts via Importance-Guided Adaptation (NAACL 2022).☆103Updated 2 years ago
- Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch☆859Updated last year
- [NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation☆471Updated last year
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways☆819Updated 2 years ago
- Implementation of Reinforcement Learning from Human Feedback (RLHF)☆172Updated last year
- [DEPRECATED] Repo for exploring multi-task learning approaches to learning sentence representations☆791Updated 3 years ago
- LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA☆206Updated last year
- Fine tune a T5 transformer model using PyTorch & Transformers🤗☆209Updated 4 years ago
- Tutorial for how to build BERT from scratch☆90Updated 10 months ago
- Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch☆321Updated 9 months ago
- Diffusion-LM☆1,115Updated 7 months ago
- [ICLR'23] DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models☆763Updated last year