affjljoo3581 / GPT2
PyTorch Implementation of OpenAI GPT-2
☆317Updated 8 months ago
Alternatives and similar repositories for GPT2:
Users that are interested in GPT2 are comparing it to the libraries listed below
- PyTorch Implementation of OpenAI GPT☆122Updated last year
- Prefix-Tuning: Optimizing Continuous Prompts for Generation☆913Updated 10 months ago
- Fine tune a T5 transformer model using PyTorch & Transformers🤗☆209Updated 4 years ago
- A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.☆927Updated 2 years ago
- [NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation☆471Updated last year
- Code for "Learning to summarize from human feedback"☆1,014Updated last year
- Code for the paper Fine-Tuning Language Models from Human Preferences☆1,296Updated last year
- The universal integrated corpus-building environment.☆30Updated 4 years ago
- Code for the ALiBi method for transformer language models (ICLR 2022)☆516Updated last year
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆1,371Updated 11 months ago
- ☆344Updated 3 years ago
- Diffusion-LM☆1,107Updated 7 months ago
- ☆96Updated last year
- Implementation of Reinforcement Learning from Human Feedback (RLHF)☆171Updated last year
- Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-…☆552Updated 10 months ago
- Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)☆193Updated 5 months ago
- [ICLR'23] DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models☆760Updated last year
- A minimum example of aligning language models with RLHF similar to ChatGPT☆217Updated last year
- Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation☆987Updated 5 years ago
- Scalable PaLM implementation of PyTorch☆192Updated 2 years ago
- Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"☆164Updated 3 years ago
- Parallelformers: An Efficient Model Parallelization Toolkit for Deployment☆786Updated last year
- LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA☆205Updated last year
- [NIPS2023] RRHF & Wombat☆803Updated last year
- train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism☆215Updated last year
- Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets☆313Updated last year
- Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.☆1,141Updated last year
- A modular RL library to fine-tune language models to human preferences☆2,287Updated last year
- The PyTorch implementation of fine-tuning the GPT-2(Generative Pre-trained Transformer 2) for dialogue generation.☆174Updated 9 months ago
- ICML'2022: NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework☆259Updated last year