Andras7 / gpt2-pytorch
Extremely simple and understandable GPT2 implementation with minor tweaks
☆20Updated 5 years ago
Alternatives and similar repositories for gpt2-pytorch:
Users that are interested in gpt2-pytorch are comparing it to the libraries listed below
- TP-N2F model☆13Updated 4 years ago
- A supplementary code for Beyond Vector Spaces: Compact Data Representation as Differentiable Weighted Graphs.☆47Updated 5 years ago
- Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch☆45Updated 3 years ago
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols☆15Updated 3 years ago
- Python package for graph statistics☆9Updated 4 years ago
- Factorization of the neural parameter space for zero-shot multi-lingual and multi-task transfer☆39Updated 4 years ago
- Automatically modelling and distilling knowledge within AI. In other words, summarising the AI research firehose.☆21Updated 5 years ago
- hierarchical convolutional attention networks for text classification☆16Updated 5 years ago
- Code for running the experiments in Deep Subjecthood: Higher Order Grammatical Features in Multilingual BERT☆16Updated last year
- ☆26Updated 5 years ago
- Towards Semantics-Enhanced Pre-Training: Can Lexicon Definitions Help Learning Sentence Meanings? (AAAI 2021)☆9Updated 3 years ago
- A python library for highly configurable transformers - easing model architecture search and experimentation.☆49Updated 3 years ago
- PyTorch code for the EMNLP 2020 paper "Embedding Words in Non-Vector Space with Unsupervised Graph Learning"☆41Updated 4 years ago
- What are the best Systems? New Perspectives on NLP Benchmarking☆13Updated last year
- Large Scale BERT Distillation☆32Updated last year
- Code for the 2019 TACL Paper "Trick Me If You Can: Human-in-the-loop Generation of Adversarial Question Answering Examples"☆34Updated 5 years ago
- ☆13Updated 6 years ago
- Training a model without a dataset for natural language inference (NLI)☆25Updated 4 years ago
- Making a bridge between NLP models and Brain data☆18Updated 4 years ago
- Bi-Directional Attention Flow for Machine Comprehensions☆9Updated 7 years ago
- ☆15Updated 2 years ago
- A generic library for crafting adversarial NLP examples - WIP☆40Updated 6 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆18Updated 2 years ago
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Updated 2 years ago
- ☆11Updated 2 years ago
- NUANCED is a user-centric conversational recommendation dataset that contains 5.1k annotated dialogues and 26k high-quality user turns.☆18Updated 3 years ago
- Baseline models for the paper: "Modeling Naive Psychology of Characters in Simple Commonsense Stories" by Hannah Rashkin, Antoine Bosselu…☆16Updated 3 years ago
- EMNLP 2021 Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections☆50Updated 3 years ago
- Tensorflow port implementation of Single Headed Attention RNN☆16Updated 5 years ago
- Unifew: Unified Fewshot Learning Model☆18Updated 3 years ago