soskek / bert-chainer
Chainer implementation of "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
☆224Updated 5 years ago
Alternatives and similar repositories for bert-chainer:
Users that are interested in bert-chainer are comparing it to the libraries listed below
- The Tensorflow code for this ACL 2018 paper: "Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mech…☆286Updated 2 years ago
- Graph Convolution Network for NLP☆212Updated last year
- Dilated CNNs for NER in TensorFlow☆242Updated 6 years ago
- ICLR 2018 Quick-Thought vectors☆204Updated 5 years ago
- On the Dimensionality of Word Embedding☆328Updated 4 years ago
- A PyTorch implementation of QANet.☆344Updated 3 years ago
- XLNet: fine tuning on RTX 2080 GPU - 8 GB☆154Updated 5 years ago
- Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Oriented Dialog Systems☆352Updated last year
- Neural network toolkit for sentence pair modeling.☆303Updated 4 years ago
- Code for the paper: Sentence-State LSTM for Text Representation☆160Updated 6 years ago
- Code of Directional Self-Attention Network (DiSAN)☆312Updated 7 years ago
- ☆322Updated 2 years ago
- add BERT to encoder part for https://github.com/memray/seq2seq-keyphrase-pytorch☆79Updated 6 years ago
- multi-gpu pre-training in one machine for BERT without horovod (Data Parallelism)☆172Updated last month
- ☆323Updated 5 years ago
- R-net in PyTorch, with ELMo☆198Updated 5 years ago
- Global-Locally Self-Attentive Dialogue State Tracker☆185Updated 3 years ago
- Contextual augmentation, a text data augmentation using a bidirectional language model.☆192Updated 5 years ago
- Enhanced LTSM for natural language inference☆264Updated 5 years ago
- ☆113Updated 7 years ago
- ☆263Updated 2 years ago
- Feel free to fine tune large BERT models with Multi-GPU and FP16 support.☆192Updated 5 years ago
- The baselines used in the CoQA paper☆176Updated 5 years ago
- Simple Tensorflow Implementation of "A Structured Self-attentive Sentence Embedding" (ICLR 2017)☆91Updated 6 years ago
- Dynamic Coattention Network Plus (DCN+) TensorFlow implementation. Question answering using Deep NLP.☆120Updated 6 years ago
- incorporating copying mechanism in sequence-to-sequence learning☆178Updated 7 years ago
- Tensorflow implementation of "A Structured Self-Attentive Sentence Embedding"☆193Updated 3 years ago
- Cleaned code for paper "Natural Language Inference over Interaction Space"☆248Updated 2 years ago
- Some frequently used NLP blocks I implemented☆226Updated 6 years ago
- BERT as language model, fork from https://github.com/google-research/bert☆247Updated last year