soskek / bert-chainer
Chainer implementation of "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
☆220Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for bert-chainer
- The Tensorflow code for this ACL 2018 paper: "Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mech…☆284Updated last year
- Graph Convolution Network for NLP☆213Updated last year
- Re-implementation of ELMo on Keras☆135Updated last year
- XLNet: fine tuning on RTX 2080 GPU - 8 GB☆154Updated 5 years ago
- A PyTorch implementation of QANet.☆345Updated 2 years ago
- ICLR 2018 Quick-Thought vectors☆205Updated 5 years ago
- On the Dimensionality of Word Embedding☆329Updated 4 years ago
- Dilated CNNs for NER in TensorFlow☆243Updated 5 years ago
- Materials from the ACL 2018 tutorial on neural semantic parsing☆403Updated 6 years ago
- ☆262Updated 2 years ago
- ☆322Updated 5 years ago
- BertQA - Attention on Steroids☆115Updated 2 years ago
- Global-Locally Self-Attentive Dialogue State Tracker☆186Updated 2 years ago
- Easy to use NLP library built on PyTorch and TorchText☆254Updated 4 years ago
- Some frequently used NLP blocks I implemented☆227Updated 5 years ago
- multi-gpu pre-training in one machine for BERT from scratch without horovod (Data Parallelism)☆173Updated 3 weeks ago
- Which Encoding is the Best for Text Classification in Chinese, English, Japanese and Korean?☆174Updated 6 years ago
- ☆300Updated 6 years ago
- The baselines used in the CoQA paper☆176Updated 4 years ago
- Multi-class metrics for Tensorflow☆225Updated 2 years ago
- ☆50Updated 4 years ago
- ALBERT model Pretraining and Fine Tuning using TF2.0☆200Updated last year
- ACL 2018: Hybrid semi-Markov CRF for Neural Sequence Labeling (http://aclweb.org/anthology/P18-2038)☆305Updated 6 years ago
- ☆114Updated 6 years ago
- XLNet for generating language.☆165Updated 3 years ago
- Code for the paper: Sentence-State LSTM for Text Representation☆158Updated 6 years ago
- BERT as language model, fork from https://github.com/google-research/bert☆247Updated 8 months ago
- Code for Adversarial Training Methods for Semi-Supervised Text Classification☆123Updated 6 years ago
- PositionRank: An Unsupervised Approach to Keyphrase Extraction from Scholarly Documents☆96Updated last year