aistairc / kirt_bert_on_abciLinks
Training BERT on ABCI
โ19Updated 4 years ago
Alternatives and similar repositories for kirt_bert_on_abci
Users that are interested in kirt_bert_on_abci are comparing it to the libraries listed below
Sorting:
- ๐ A demonstration of hyperparameter optimization using Optuna for models implemented with AllenNLP.โ16Updated 5 years ago
- Python package for understanding the difficulty of text classification datasets. (in CoNNL 2018)โ64Updated 4 years ago
- Text classification with Sparse Composite Document Vectors.โ61Updated 5 years ago
- โ34Updated 5 years ago
- NanigoNet โ Language detector for code-mixed input supporting 150+19 human+programming languages using deep neural networksโ71Updated 2 years ago
- Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"โ40Updated 7 years ago
- A Chainer implementation of OpenAI's finetuned transformer language model with a script to import the weights pre-trained by OpenAIโ28Updated 7 years ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselinesโ138Updated 2 years ago
- PyTorch source code of NAACL 2019 paper "An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models"โ96Updated 2 years ago
- Deliver the ready-to-train data to your NLP model.โ122Updated 3 years ago
- # ParlAI Agent examples with PyTorch, Chainer and TensorFlowโ46Updated 8 years ago
- Python Implementation of EmbedRankโ48Updated 6 years ago
- ๆฌใชใใธใใชใฏใAllenNLPๅ ฅ้ใใฎใฝใผในใณใผใ็ฝฎใๅ ดใงใใโ35Updated 2 years ago
- LM Pretraining with PyTorch/TPUโ137Updated 6 years ago
- Python binding of primitiv.โ17Updated 3 years ago
- โ47Updated 6 years ago
- Implementation of unsupervised smoothed inverse frequency (Best Paper, Repl4NLP @ ACL 2018)โ79Updated 6 years ago
- ๅ่ชๅๅฒใ็ต็ฑใใชใๅ่ชๅใ่พผใฟโ14Updated 8 years ago
- Assorted tools and utility functions, mainly for doing NLP with Pythonโ23Updated 4 months ago
- Pre-training of Language Models for Language Understandingโ83Updated 6 years ago
- Code for bidirectional sequence generation (BiSon) for generating from BERT pre-trained models.โ51Updated 5 years ago
- Embedding Quantization (Compress Word Embeddings)โ85Updated 6 years ago
- Pytorch implementation and pre-trained Japanese model for CANINE, the efficient character-level transformer.โ89Updated 2 years ago
- Codes to pre-train Japanese T5 modelsโ40Updated 4 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.โ147Updated 4 years ago
- A Pytorch-based Neural Machine Translation Framework for Researchโ26Updated 5 years ago
- ๐ Implementation of Neural Network based Named Entity Recognizer (Lample+, 2016) using Chainer.โ45Updated 3 years ago
- Chainer implementation of "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"โ225Updated 6 years ago
- Decoding platform for machine translation researchโ54Updated 6 years ago
- a Pytorch implementation of the Reformer Network (https://openreview.net/pdf?id=rkgNKkHtvB)โ53Updated 3 years ago