aistairc / kirt_bert_on_abciLinks
Training BERT on ABCI
☆19Updated 3 years ago
Alternatives and similar repositories for kirt_bert_on_abci
Users that are interested in kirt_bert_on_abci are comparing it to the libraries listed below
Sorting:
- 🚀 A demonstration of hyperparameter optimization using Optuna for models implemented with AllenNLP.☆16Updated 4 years ago
- ☆34Updated 5 years ago
- Python package for understanding the difficulty of text classification datasets. (in CoNNL 2018)☆63Updated 4 years ago
- NanigoNet — Language detector for code-mixed input supporting 150+19 human+programming languages using deep neural networks☆72Updated 2 years ago
- Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"☆40Updated 6 years ago
- Pytorch implementation and pre-trained Japanese model for CANINE, the efficient character-level transformer.☆89Updated last year
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆136Updated last year
- Codes to pre-train Japanese T5 models☆41Updated 3 years ago
- A Pytorch-based Neural Machine Translation Framework for Research☆26Updated 4 years ago
- An example usage of JParaCrawl pre-trained Neural Machine Translation (NMT) models.☆103Updated 4 years ago
- Robsut Wrod Reocginiton via semi-Character Recurrent Neural Network☆21Updated 7 years ago
- ⚡️ AllenNLP plugin for adding subcommands to use Optuna, making hyperparameter optimization easy☆33Updated 3 years ago
- Implementation of unsupervised smoothed inverse frequency (Best Paper, Repl4NLP @ ACL 2018)☆77Updated 6 years ago
- Sequence to Sequence Models in PyTorch☆44Updated 11 months ago
- 🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.☆68Updated 5 years ago
- DeepThought's solution☆80Updated last year
- The Business Scene Dialogue corpus☆68Updated 3 years ago
- A simple implementation of SimCSE☆77Updated 2 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Updated 3 years ago
- ☯️ AllenNLP training configurations for promising models on Named Entity Recognition. (BiLSTM-CRF, BiLSTM-CNN-CRF, BERT, BERT-CRF)☆15Updated 4 years ago
- Pre-training of Language Models for Language Understanding☆83Updated 5 years ago
- Chainer implementation of "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"☆224Updated 5 years ago
- Scripts to train a bidirectional LSTM with knowledge distillation from BERT☆158Updated 5 years ago
- Python implementation of A La Carte Embedding☆9Updated 6 years ago
- LM Pretraining with PyTorch/TPU☆134Updated 5 years ago
- Implementation of the LAMB optimizer for Keras from the paper "Reducing BERT Pre-Training Time from 3 Days to 76 Minutes"☆75Updated 6 years ago
- Code for bidirectional sequence generation (BiSon) for generating from BERT pre-trained models.☆51Updated 5 years ago
- PyTorch source code of NAACL 2019 paper "An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models"☆96Updated last year
- Assorted tools and utility functions, mainly for doing NLP with Python☆24Updated 5 months ago
- MT Tutorial for the JSALT 2019 Summer School☆48Updated 6 years ago