LM Pretraining with PyTorch/TPU
☆137Oct 24, 2019Updated 6 years ago
Alternatives and similar repositories for tpu_pretrain
Users that are interested in tpu_pretrain are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- Transformer training code for sequential tasks☆609Sep 14, 2021Updated 4 years ago
- A fork of the official TPU models repo with fixes and a solution of the Kaggle Open Images 2019 Object Detection Challenge☆49Oct 15, 2019Updated 6 years ago
- Efficient Sentence Embedding via Semantic Subspace Analysis☆14Feb 25, 2020Updated 6 years ago
- Enabling PyTorch on XLA Devices (e.g. Google TPU)☆2,763Dec 18, 2025Updated 4 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 매주 목요일, 20:00 모임☆16Jul 24, 2020Updated 5 years ago
- Cascaded Text Generation with Markov Transformers☆130Mar 20, 2023Updated 3 years ago
- The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)☆119Oct 8, 2020Updated 5 years ago
- ☆25Oct 28, 2020Updated 5 years ago
- Repository for the paper "Optimal Subarchitecture Extraction for BERT"☆470Jun 22, 2022Updated 3 years ago
- ☆41Feb 12, 2019Updated 7 years ago
- Variational Methods for Pretraining in Resource-limited Environments☆174Jul 29, 2020Updated 5 years ago
- Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)☆102Nov 2, 2020Updated 5 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆22Jan 25, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆220Jun 8, 2020Updated 5 years ago
- Simple extension of WikiExtractor(https://github.com/attardi/wikiextractor)☆16Dec 23, 2016Updated 9 years ago
- IEEE/ACM TASLP 2020: SBERT-WK: A Sentence Embedding Method By Dissecting BERT-based Word Models☆182Jan 28, 2021Updated 5 years ago
- Scale your ML workers asynchronously across processes and machines☆13Apr 1, 2025Updated last year
- Code to support the paper "Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets"☆65Aug 31, 2021Updated 4 years ago
- ☆12Feb 22, 2021Updated 5 years ago
- 초성 해석기 based on ko-BART☆29Mar 31, 2021Updated 5 years ago
- This repository contains example code to build models on TPUs☆30Feb 17, 2023Updated 3 years ago
- Encode-attend-navigate unofficial Pytorch implementation☆12Oct 1, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- ☆11Aug 12, 2020Updated 5 years ago
- [Unofficial] Kakaotrans: Kakao translate API for python☆16Mar 29, 2020Updated 6 years ago
- Korean Visual Question Answering☆59Feb 18, 2020Updated 6 years ago
- Get Slack notifications while training FastAI models☆13May 20, 2019Updated 6 years ago
- Method to improve inference time for BERT. This is an implementation of the paper titled "PoWER-BERT: Accelerating BERT Inference via Pro…☆62Sep 17, 2025Updated 7 months ago
- Hyperparameter Search for AllenNLP☆140Mar 6, 2025Updated last year
- A barebones (Distil)BERT pipeline for token classification tasks driven by catalyst☆13Oct 14, 2019Updated 6 years ago
- FastFormers - highly efficient transformer models for NLU☆709Mar 21, 2025Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Repository of code for the tutorial on Transfer Learning in NLP held at NAACL 2019 in Minneapolis, MN, USA☆722Oct 16, 2019Updated 6 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Jul 26, 2021Updated 4 years ago
- 국내 자연어 처리 기술을 연구 및 개발하는 스타트업 목록☆164May 10, 2020Updated 5 years ago
- A BART version of an open-domain QA model in a closed-book setup☆119Aug 13, 2020Updated 5 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆50Dec 6, 2024Updated last year
- 📲 Transformers android examples (Tensorflow Lite & Pytorch Mobile)☆83Jun 12, 2023Updated 2 years ago
- ☆40Jun 2, 2021Updated 4 years ago