yifding / hetseqLinks

HetSeq: Distributed GPU Training on Heterogeneous Infrastructure

☆106

Alternatives and similar repositories for hetseq

Users that are interested in hetseq are comparing it to the libraries listed below

Sorting:

ofirpress / shortformer
Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.
☆147Updated 4 years ago
srush / awesome-ml-tracking
☆104Updated 4 years ago
LaihoE / did-it-spill
Check if you have training samples in your test set
☆64Updated 3 years ago
lf1-io / padl
Functional deep learning
☆108Updated 2 years ago
facebookresearch / dynalab
The Python library with command line tools to interact with Dynabench(https://dynabench.org/), such as uploading models.
☆55Updated 3 years ago
HomebrewML / HomebrewNLP-torch
A case study of efficient training of large language models using commodity hardware.
☆68Updated 3 years ago
EleutherAI / pyfra
Python Research Framework
☆106Updated 3 years ago
srush / parallax
☆153Updated 5 years ago
minitorch / minitorch.github.io
Docs
☆143Updated 11 months ago
gsarti / lambda-bert
A 🤗-style implementation of BERT using lambda layers instead of self-attention
☆69Updated 5 years ago
facebookresearch / transformer-sequential
Trains Transformer model variants. Data isn't shuffled between batches.
☆143Updated 3 years ago
r0mainK / outperformer
Code for scaling Transformers
☆26Updated 4 years ago
prajjwal1 / fluence
A deep learning library based on Pytorch focussed on low resource language research and robustness
☆70Updated 3 years ago
allenai / tpu_pretrain
LM Pretraining with PyTorch/TPU
☆136Updated 6 years ago
awaelchli / pytorch-lightning-snippets
A collection of code snippets for my PyTorch Lightning projects
☆107Updated 4 years ago
lucidrains / g-mlp-gpt
GPT, but made only out of MLPs
☆89Updated 4 years ago
asappresearch / flambe
An ML framework to accelerate research and its path to production.
☆267Updated last year
lucidrains / charformer-pytorch
Implementation of the GBST block from the Charformer paper, in Pytorch
☆119Updated 4 years ago
HendrikStrobelt / LMdiff
A diff tool for language models
☆44Updated last year
AndreasMadsen / python-textualheatmap
Create interactive textual heat maps for Jupiter notebooks
☆196Updated last year
yandex-research / DeDLOC
Official code for "Distributed Deep Learning in Open Collaborations" (NeurIPS 2021)
☆118Updated 3 years ago
lucidrains / feedback-transformer-pytorch
Implementation of Feedback Transformer in Pytorch
☆108Updated 4 years ago
louislva / deepmind-perceiver
My implementation of DeepMind's Perceiver
☆63Updated 4 years ago
lucidrains / electra-pytorch
A simple and working implementation of Electra, the fastest way to pretrain language models from scratch, in Pytorch
☆235Updated 2 years ago
aiwabdn / pygln
Python implementation of GLN in different frameworks
☆97Updated 5 years ago
SirRob1997 / Crowded-Valley---Results
This repository contains the results for the paper: "Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers"
☆184Updated 4 years ago
r2llab / wrangl
Parallel data preprocessing for NLP and ML.
☆34Updated last year
iKernels / transformers-lightning
A collection of Models, Datasets, DataModules, Callbacks, Metrics, Losses and Loggers to better integrate pytorch-lightning with transfor…
☆47Updated 2 years ago
shawwn / tpunicorn
Babysit your preemptible TPUs
☆86Updated 2 years ago
cybertronai / transformer-xl
Training Transformer-XL on 128 GPUs
☆141Updated 5 years ago