allenai/tpu_pretrain

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/allenai/tpu_pretrain)

allenai / tpu_pretrain

LM Pretraining with PyTorch/TPU

☆137

Alternatives and similar repositories for tpu_pretrain

Users that are interested in tpu_pretrain are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

facebookresearch / SentAugment
View on GitHub
SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…
☆359Feb 22, 2022Updated 4 years ago
facebookresearch / adaptive-span
View on GitHub
Transformer training code for sequential tasks
☆610Sep 14, 2021Updated 4 years ago
artyompal / tpu_models
View on GitHub
A fork of the official TPU models repo with fixes and a solution of the Kaggle Open Images 2019 Object Detection Challenge
☆49Oct 15, 2019Updated 6 years ago
BinWang28 / Sentence-Embedding-S3E
View on GitHub
Efficient Sentence Embedding via Semantic Subspace Analysis
☆14Feb 25, 2020Updated 6 years ago
pytorch / xla
View on GitHub
Enabling PyTorch on XLA Devices (e.g. Google TPU)
☆2,795May 27, 2026Updated 2 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ModuNLP / weekly-meeting
View on GitHub
매주 목요일, 20:00 모임
☆16Jul 24, 2020Updated 6 years ago
harvardnlp / cascaded-generation
View on GitHub
Cascaded Text Generation with Markov Transformers
☆130Mar 20, 2023Updated 3 years ago
kakaobrain / kortok
View on GitHub
The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)
☆119Oct 8, 2020Updated 5 years ago
seujung / t5-summarization
View on GitHub
☆25Oct 28, 2020Updated 5 years ago
alexa / bort
View on GitHub
Repository for the paper "Optimal Subarchitecture Extraction for BERT"
☆470Jun 22, 2022Updated 4 years ago
allenai / allennlp-reading-comprehension-research
View on GitHub
☆41Feb 12, 2019Updated 7 years ago
allenai / vampire
View on GitHub
Variational Methods for Pretraining in Resource-limited Environments
☆174Jul 29, 2020Updated 6 years ago
clovaai / length-adaptive-transformer
View on GitHub
Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)
☆102Nov 2, 2020Updated 5 years ago
pytorch-tpu / fairseq
View on GitHub
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
☆22Jan 25, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
laiguokun / Funnel-Transformer
View on GitHub
☆220Jun 8, 2020Updated 6 years ago
j-min / WikiExtractor_To_the_one_text
View on GitHub
Simple extension of WikiExtractor(https://github.com/attardi/wikiextractor)
☆16Dec 23, 2016Updated 9 years ago
BinWang28 / SBERT-WK-Sentence-Embedding
View on GitHub
IEEE/ACM TASLP 2020: SBERT-WK: A Sentence Embedding Method By Dissecting BERT-based Word Models
☆181Jan 28, 2021Updated 5 years ago
facebookresearch / QA-Overlap
View on GitHub
Code to support the paper "Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets"
☆66Aug 31, 2021Updated 4 years ago
astariul / gibbs
View on GitHub
Scale your ML workers asynchronously across processes and machines
☆13Apr 1, 2025Updated last year
idiap / inv-tn
View on GitHub
A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)
☆21Sep 27, 2017Updated 8 years ago
noowad93 / chosung-translator
View on GitHub
초성 해석기 based on ko-BART
☆29Mar 31, 2021Updated 5 years ago
pytorch-tpu / examples
View on GitHub
This repository contains example code to build models on TPUs
☆30Feb 17, 2023Updated 3 years ago
astariul / encode-attend-navigate-pytorch
View on GitHub
Encode-attend-navigate unofficial Pytorch implementation
☆12Oct 1, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
noiseQA / NoiseQA
View on GitHub
☆12Feb 22, 2021Updated 5 years ago
ModuNLP / hacking_transformers
View on GitHub
☆11Aug 12, 2020Updated 5 years ago
monologg / kakaotrans
View on GitHub
[Unofficial] Kakaotrans: Kakao translate API for python
☆16Mar 29, 2020Updated 6 years ago
JovianHQ / fastai_slack
View on GitHub
Get Slack notifications while training FastAI models
☆13May 20, 2019Updated 7 years ago
SKTBrain / KVQA
View on GitHub
Korean Visual Question Answering
☆59Feb 18, 2020Updated 6 years ago
Huffon / nlp-startups
View on GitHub
국내 자연어 처리 기술을 연구 및 개발하는 스타트업 목록
☆163May 10, 2020Updated 6 years ago
catalyst-team / bert
View on GitHub
A barebones (Distil)BERT pipeline for token classification tasks driven by catalyst
☆13Oct 14, 2019Updated 6 years ago
monologg / transformers-android-demo
View on GitHub
📲 Transformers android examples (Tensorflow Lite & Pytorch Mobile)
☆83Jun 12, 2023Updated 3 years ago
allenai / allentune
View on GitHub
Hyperparameter Search for AllenNLP
☆141Mar 6, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
microsoft / fastformers
View on GitHub
FastFormers - highly efficient transformer models for NLU
☆706Mar 21, 2025Updated last year
huggingface / naacl_transfer_learning_tutorial
View on GitHub
Repository of code for the tutorial on Transfer Learning in NLP held at NAACL 2019 in Minneapolis, MN, USA
☆724Oct 16, 2019Updated 6 years ago
IBM / PoWER-BERT
View on GitHub
Method to improve inference time for BERT. This is an implementation of the paper titled "PoWER-BERT: Accelerating BERT Inference via Pro…
☆63Sep 17, 2025Updated 10 months ago
ofirpress / shortformer
View on GitHub
Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.
☆147Jul 26, 2021Updated 5 years ago
shmsw25 / bart-closed-book-qa
View on GitHub
A BART version of an open-domain QA model in a closed-book setup
☆118Aug 13, 2020Updated 5 years ago
Kaleidophon / token2index
View on GitHub
A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …
☆50Dec 6, 2024Updated last year
icml-2020-nlp / semsim
View on GitHub
☆40Jun 2, 2021Updated 5 years ago