pytorch-tpu / examplesLinks

This repository contains example code to build models on TPUs

☆30

Alternatives and similar repositories for examples

Users that are interested in examples are comparing it to the libraries listed below

Sorting:

HomebrewML / HomebrewNLP-torch
A case study of efficient training of large language models using commodity hardware.
☆68Updated 3 years ago
fastai / fastgpu
A queue service for quickly developing scripts that use all your GPUs efficiently
☆88Updated 3 years ago
iKernels / transformers-lightning
A collection of Models, Datasets, DataModules, Callbacks, Metrics, Losses and Loggers to better integrate pytorch-lightning with transfor…
☆47Updated 2 years ago
lucidrains / g-mlp-gpt
GPT, but made only out of MLPs
☆89Updated 4 years ago
r0mainK / outperformer
Code for scaling Transformers
☆26Updated 5 years ago
Borda / pyDeprecate
Smoothly deprecate and redirect Python functions/classes with smart warnings and auto-routing—keep your codebase clean while maintaining …
☆51Updated this week
prajjwal1 / fluence
A deep learning library based on Pytorch focussed on low resource language research and robustness
☆70Updated 4 years ago
allenai / tpu_pretrain
LM Pretraining with PyTorch/TPU
☆136Updated 6 years ago
shawwn / tpunicorn
Babysit your preemptible TPUs
☆86Updated 3 years ago
huggingface / tune
☆87Updated 3 years ago
lucidrains / n-grammer-pytorch
Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch
☆76Updated 3 years ago
HomebrewML / Olmax
HomebrewNLP in JAX flavour for maintable TPU-Training
☆51Updated last year
fidelity / stoke
A lightweight wrapper for PyTorch that provides a simple declarative API for context switching between devices, distributed modes, mixed-…
☆67Updated 2 years ago
tmabraham / fastai_tpu
TPU support for the fastai library
☆13Updated 4 years ago
nikitakit / sabertooth
Standalone pre-training recipe with JAX+Flax
☆33Updated 2 years ago
EleutherAI / pyfra
Python Research Framework
☆106Updated 3 years ago
yifding / hetseq
HetSeq: Distributed GPU Training on Heterogeneous Infrastructure
☆106Updated 2 years ago
sayakpaul / count-tokens-hf-datasets
This project shows how to derive the total number of training tokens from a large text dataset from 🤗 datasets with Apache Beam and Data…
☆27Updated 3 years ago
ofirpress / shortformer
Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.
☆147Updated 4 years ago
MathInf / toroidal
a lightweight transformer library for PyTorch
☆72Updated 4 years ago
stas00 / porting
Helper scripts and notes that were used while porting various nlp models
☆48Updated 3 years ago
PetrochukM / HParams
Configure Python functions explicitly and safely
☆128Updated last year
yandex-research / DeDLOC
Official code for "Distributed Deep Learning in Open Collaborations" (NeurIPS 2021)
☆118Updated 3 years ago
srush / awesome-ml-tracking
☆104Updated 4 years ago
NathanGodey / headless-lm
Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…
☆28Updated last year
krandiash / quinine
A library to create and manage configuration files, especially for machine learning projects.
☆79Updated 3 years ago
cgraywang / transformer-on-diet
Code repo for "Transformer on a Diet" paper
☆31Updated 5 years ago
lf1-io / padl
Functional deep learning
☆108Updated 3 years ago
google-research / precondition
☆31Updated 2 weeks ago
cceyda / lit-NER
TorchServe+Streamlit for easily serving your HuggingFace NER models
☆33Updated 3 years ago