Andras7 / gpt2-pytorch
Extremely simple and understandable GPT2 implementation with minor tweaks
☆21Updated 5 years ago
Alternatives and similar repositories for gpt2-pytorch:
Users that are interested in gpt2-pytorch are comparing it to the libraries listed below
- Similarity Encoder (SimEc) Neural Network Framework for learning low dimensional similarity preserving representations☆17Updated 4 years ago
- PyTorch implementation of GLOM☆21Updated 3 years ago
- Tensorflow port implementation of Single Headed Attention RNN☆16Updated 5 years ago
- ☆13Updated 6 years ago
- Python package for graph statistics☆9Updated 4 years ago
- A python library for highly configurable transformers - easing model architecture search and experimentation.☆49Updated 3 years ago
- PyTorch code for the EMNLP 2020 paper "Embedding Words in Non-Vector Space with Unsupervised Graph Learning"☆41Updated 4 years ago
- A supplementary code for Beyond Vector Spaces: Compact Data Representation as Differentiable Weighted Graphs.☆47Updated 5 years ago
- Official Implementation of "Transferring Inductive Biases Through Knowledge Distillation"☆14Updated 4 years ago
- Code for Paper "Target-oriented Fine-tuning for Zero-Resource Named Entity Recognition"☆21Updated 2 years ago
- ☆22Updated 8 months ago
- The code repository associated with the NeurIPS 2020 paper: "Towards Neural Programming Interfaces"☆13Updated 2 years ago
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆33Updated 2 years ago
- A simple semantic search engine for scientific papers.☆28Updated last year
- NLG Best Practices for Data-Efficient Modeling How to Train Production-Ready Models with Little Data☆10Updated 3 years ago
- Unofficially Implements https://arxiv.org/abs/2112.05682 to get Linear Memory Cost on Attention for PyTorch☆12Updated 3 years ago
- ☆28Updated last year
- Large Scale BERT Distillation☆32Updated 2 years ago
- Code for "Rissanen Data Analysis: Examining Dataset Characteristics via Description Length" by Ethan Perez, Douwe Kiela, and Kyungyhun Ch…☆35Updated 3 years ago
- Code for pre-training CharacterBERT models (as well as BERT models).☆34Updated 3 years ago
- What are the best Systems? New Perspectives on NLP Benchmarking☆13Updated 2 years ago
- Introduction Notebook to Extreme Multi-Label Classification problem (XML)☆22Updated 6 years ago
- ☆24Updated 11 months ago
- LEMON: Explainable Entity Matching☆18Updated 3 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆48Updated 3 years ago
- Repository for group 17 on the Statistical Natural Language Processing module at UCL☆22Updated 3 years ago
- EMNLP 2021 Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections☆50Updated 3 years ago
- ☆21Updated 6 years ago
- Training a model without a dataset for natural language inference (NLI)☆25Updated 4 years ago
- Concealed Data Poisoning Attacks on NLP Models☆21Updated last year