affjljoo3581 / GPT2Links
PyTorch Implementation of OpenAI GPT-2
☆344Updated last year
Alternatives and similar repositories for GPT2
Users that are interested in GPT2 are comparing it to the libraries listed below
Sorting:
- Code for the ALiBi method for transformer language models (ICLR 2022)☆543Updated last year
- The universal integrated corpus-building environment.☆31Updated 5 years ago
- Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation☆1,008Updated 6 years ago
- Plain pytorch implementation of LLaMA☆188Updated 2 years ago
- The PyTorch implementation of fine-tuning the GPT-2(Generative Pre-trained Transformer 2) for dialogue generation.☆176Updated last year
- [NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation☆473Updated last year
- Scalable PaLM implementation of PyTorch☆188Updated 2 years ago
- Fine tune a T5 transformer model using PyTorch & Transformers🤗☆218Updated 4 years ago
- Fine-tuning GPT-2 Small for Question Answering☆130Updated 2 years ago
- LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA☆228Updated last month
- ☆560Updated 2 years ago
- A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.☆936Updated 3 years ago
- Prefix-Tuning: Optimizing Continuous Prompts for Generation☆946Updated last year
- Pytorch implementation of GPT-1☆26Updated 3 years ago
- A minimum example of aligning language models with RLHF similar to ChatGPT☆221Updated 2 years ago
- Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-…☆564Updated last year
- Implementation of Reinforcement Learning from Human Feedback (RLHF)☆173Updated 2 years ago
- Tutorial for how to build BERT from scratch☆99Updated last year
- train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism☆224Updated last year
- A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models☆813Updated 2 years ago
- Transformer implementation in PyTorch.☆491Updated 6 years ago
- A research project for natural language generation, containing the official implementations by MSRA NLC team.☆736Updated last year
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆1,422Updated last year
- Scripts for fine-tuning Llama2 via SFT and DPO.☆203Updated 2 years ago
- This PyTorch package implements MoEBERT: from BERT to Mixture-of-Experts via Importance-Guided Adaptation (NAACL 2022).☆112Updated 3 years ago
- (Unofficial) PyTorch implementation of grouped-query attention (GQA) from "GQA: Training Generalized Multi-Query Transformer Models from …☆180Updated last year
- An implementation of masked language modeling for Pytorch, made as concise and simple as possible☆179Updated 2 years ago
- Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)☆232Updated 6 months ago
- LLaMA 2 implemented from scratch in PyTorch☆353Updated 2 years ago
- Dialogue State Tracking (DST) Papers, Datasets, Resources 🤩☆192Updated 2 years ago