aistairc / kirt_bert_on_abciLinks

Training BERT on ABCI

☆19

Alternatives and similar repositories for kirt_bert_on_abci

Users that are interested in kirt_bert_on_abci are comparing it to the libraries listed below

Sorting:

himkt / optuna-allennlp
🚀 A demonstration of hyperparameter optimization using Optuna for models implemented with AllenNLP.
☆16Updated 4 years ago
cfiken / paper-reading
☆34Updated 5 years ago
Wluper / edm
Python package for understanding the difficulty of text classification datasets. (in CoNNL 2018)
☆63Updated 4 years ago
mhagiwara / nanigonet
NanigoNet — Language detector for code-mixed input supporting 150+19 human+programming languages using deep neural networks
☆72Updated 2 years ago
AkariAsai / extractive_rc_by_runtime_mt
Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"
☆40Updated 6 years ago
octanove / shiba
Pytorch implementation and pre-trained Japanese model for CANINE, the efficient character-level transformer.
☆89Updated last year
uds-lsv / bert-stable-fine-tuning
On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines
☆136Updated last year
megagonlabs / t5-japanese
Codes to pre-train Japanese T5 models
☆41Updated 3 years ago
zomux / nmtlab
A Pytorch-based Neural Machine Translation Framework for Research
☆26Updated 4 years ago
MorinoseiMorizo / jparacrawl-finetune
An example usage of JParaCrawl pre-trained Neural Machine Translation (NMT) models.
☆103Updated 4 years ago
keisks / robsut-wrod-reocginiton
Robsut Wrod Reocginiton via semi-Character Recurrent Neural Network
☆21Updated 7 years ago
himkt / allennlp-optuna
⚡️ AllenNLP plugin for adding subcommands to use Optuna, making hyperparameter optimization easy
☆33Updated 3 years ago
kawine / usif
Implementation of unsupervised smoothed inverse frequency (Best Paper, Repl4NLP @ ACL 2018)
☆77Updated 6 years ago
threelittlemonkeys / seq2seq-pytorch
Sequence to Sequence Models in PyTorch
☆44Updated 11 months ago
hsajjad / transformers
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
☆68Updated 5 years ago
see-- / natural-question-answering
DeepThought's solution
☆80Updated last year
tsuruoka-lab / BSD
The Business Scene Dialogue corpus
☆68Updated 3 years ago
hppRC / simple-simcse
A simple implementation of SimCSE
☆77Updated 2 years ago
ofirpress / shortformer
Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.
☆147Updated 3 years ago
himkt / allennlp-NER
☯️ AllenNLP training configurations for promising models on Named Entity Recognition. (BiLSTM-CRF, BiLSTM-CNN-CRF, BERT, BERT-CRF)
☆15Updated 4 years ago
lyeoni / pretraining-for-language-understanding
Pre-training of Language Models for Language Understanding
☆83Updated 5 years ago
soskek / bert-chainer
Chainer implementation of "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
☆224Updated 5 years ago
tacchinotacchi / distil-bilstm
Scripts to train a bidirectional LSTM with knowledge distillation from BERT
☆158Updated 5 years ago
yagays / alacarte_embedding
Python implementation of A La Carte Embedding
☆9Updated 6 years ago
allenai / tpu_pretrain
LM Pretraining with PyTorch/TPU
☆134Updated 5 years ago
titu1994 / keras-LAMB-Optimizer
Implementation of the LAMB optimizer for Keras from the paper "Reducing BERT Pre-Training Time from 3 Days to 76 Minutes"
☆75Updated 6 years ago
carolinlawrence / BiSon
Code for bidirectional sequence generation (BiSon) for generating from BERT pre-trained models.
☆51Updated 5 years ago
alexandra-chron / siatl
PyTorch source code of NAACL 2019 paper "An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models"
☆96Updated last year
bheinzerling / dougu
Assorted tools and utility functions, mainly for doing NLP with Python
☆24Updated 5 months ago
pmichel31415 / jsalt-2019-mt-tutorial
MT Tutorial for the JSALT 2019 Summer School
☆48Updated 6 years ago