LM Pretraining with PyTorch/TPU
☆137Oct 24, 2019Updated 6 years ago
Alternatives and similar repositories for tpu_pretrain
Users that are interested in tpu_pretrain are comparing it to the libraries listed below
Sorting:
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- Repository for the paper "Optimal Subarchitecture Extraction for BERT"☆470Jun 22, 2022Updated 3 years ago
- Transformer training code for sequential tasks☆609Sep 14, 2021Updated 4 years ago
- IEEE/ACM TASLP 2020: SBERT-WK: A Sentence Embedding Method By Dissecting BERT-based Word Models☆182Jan 28, 2021Updated 5 years ago
- This repository contains example code to build models on TPUs☆30Feb 17, 2023Updated 3 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆22Jan 25, 2023Updated 3 years ago
- Variational Methods for Pretraining in Resource-limited Environments☆174Jul 29, 2020Updated 5 years ago
- Get Slack notifications while training FastAI models☆13May 20, 2019Updated 6 years ago
- Enabling PyTorch on XLA Devices (e.g. Google TPU)☆2,755Dec 18, 2025Updated 2 months ago
- Method to improve inference time for BERT. This is an implementation of the paper titled "PoWER-BERT: Accelerating BERT Inference via Pro…☆62Sep 17, 2025Updated 5 months ago
- 매주 목요일, 20:00 모임☆16Jul 24, 2020Updated 5 years ago
- ☆221Jun 8, 2020Updated 5 years ago
- A fork of the official TPU models repo with fixes and a solution of the Kaggle Open Images 2019 Object Detection Challenge☆49Oct 15, 2019Updated 6 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Dec 6, 2024Updated last year
- ☆12Feb 22, 2021Updated 5 years ago
- ☆41Feb 12, 2019Updated 7 years ago
- ☆25Oct 28, 2020Updated 5 years ago
- Cascaded Text Generation with Markov Transformers☆130Mar 20, 2023Updated 2 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Jul 26, 2021Updated 4 years ago
- FastFormers - highly efficient transformer models for NLU☆709Mar 21, 2025Updated 11 months ago
- ☆11Aug 12, 2020Updated 5 years ago
- Korean Visual Question Answering☆59Feb 18, 2020Updated 6 years ago
- [Unofficial] Kakaotrans: Kakao translate API for python☆16Mar 29, 2020Updated 5 years ago
- Neural Text Generation with Unlikelihood Training☆310Aug 31, 2021Updated 4 years ago
- A BART version of an open-domain QA model in a closed-book setup☆119Aug 13, 2020Updated 5 years ago
- Simple extension of WikiExtractor(https://github.com/attardi/wikiextractor)☆16Dec 23, 2016Updated 9 years ago
- 📲 Transformers android examples (Tensorflow Lite & Pytorch Mobile)☆83Jun 12, 2023Updated 2 years ago
- 국내 자연어 처리 기술을 연구 및 개발하는 스타트업 목록☆165May 10, 2020Updated 5 years ago
- Scale your ML workers asynchronously across processes and machines☆13Apr 1, 2025Updated 11 months ago
- 초성 해석기 based on ko-BART☆29Mar 31, 2021Updated 4 years ago
- NER task for Naver NLP Challenge 2018 (3rd Place)☆18Mar 24, 2023Updated 2 years ago
- ☆40Jun 2, 2021Updated 4 years ago
- Hyperparameter Search for AllenNLP☆140Mar 6, 2025Updated last year
- An Analysis Toolkit for Natural Language Generation (Translation, Captioning, Summarization, etc.)☆451Feb 22, 2026Updated 2 weeks ago
- Repository of code for the tutorial on Transfer Learning in NLP held at NAACL 2019 in Minneapolis, MN, USA☆722Oct 16, 2019Updated 6 years ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- PyTorch implementation of NAACL 2021 paper "Multi-view Subword Regularization"☆26Jun 2, 2021Updated 4 years ago
- Unsupervised text tokenizer focused on computational efficiency☆975Mar 29, 2024Updated last year
- Longformer: The Long-Document Transformer☆2,189Feb 8, 2023Updated 3 years ago