PrincetonUniversity / multi_gpu_trainingLinks

☆345

Alternatives and similar repositories for multi_gpu_training

Users that are interested in multi_gpu_training are comparing it to the libraries listed below

Sorting:

srush / annotated-mamba
Annotated version of the Mamba paper
☆486Updated last year
pytorch-labs / attention-gym
Helpful tools and examples for working with flex-attention
☆876Updated last week
elyall / wandb_on_slurm
Example of how to use Weights & Biases on Slurm
☆115Updated 2 years ago
HazyResearch / aisys-building-blocks
Building blocks for foundation models.
☆516Updated last year
lucidrains / st-moe-pytorch
Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch
☆350Updated last year
CSCfi / pytorch-ddp-examples
☆52Updated last year
klieret / wandb-offline-sync-hook
A convenient way to trigger synchronizations to wandb / Weights & Biases if your compute nodes don't have internet!
☆82Updated last week
facebookresearch / FFCV-SSL
FFCV-SSL Fast Forward Computer Vision for Self-Supervised Learning.
☆208Updated last year
lucidrains / memory-efficient-attention-pytorch
Implementation of a memory efficient multi-head attention as proposed in the paper, "Self-attention Does Not Need O(n²) Memory"
☆378Updated 2 years ago
facebookresearch / optimizers
For optimization algorithm research and development.
☆521Updated this week
fferflo / einx
Universal Tensor Operations in Einstein-Inspired Notation for Python.
☆385Updated 3 months ago
pytorch / tensordict
TensorDict is a pytorch dedicated tensor container.
☆942Updated this week
kvfrans / jax-diffusion-transformer
Implementation of Diffusion Transformer (DiT) in JAX
☆279Updated last year
lucidrains / rotary-embedding-torch
Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch
☆708Updated last week
lucidrains / ring-attention-pytorch
Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch
☆529Updated 2 months ago
google-research / jaxpruner
☆230Updated 5 months ago
tintn / vision-transformer-from-scratch
A Simplified PyTorch Implementation of Vision Transformer (ViT)
☆193Updated last year
srush / annotated-s4
Implementation of https://srush.github.io/annotated-s4
☆499Updated 3 weeks ago
AvivBick / awesome-ssm-ml
Reading list for research topics in state-space models
☆306Updated last month
mlcommons / algorithmic-efficiency
MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…
☆388Updated last week
PeaBrane / mamba-tiny
Simple, minimal implementation of the Mamba SSM in one pytorch file. Using logcumsumexp (Heisen sequence).
☆120Updated 9 months ago
facebookincubator / submitit
Python 3.8+ toolbox for submitting jobs to Slurm
☆1,478Updated last month
BlackHC / toma
Helps you write algorithms in PyTorch that adapt to the available (CUDA) memory
☆438Updated 10 months ago
bobby-he / simplified_transformers
☆292Updated 7 months ago
lucidrains / soft-moe-pytorch
Implementation of Soft MoE, proposed by Brain's Vision team, in Pytorch
☆304Updated 3 months ago
pytorch / torcheval
A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to fac…
☆235Updated 6 months ago
MinghuiChen43 / awesome-deep-phenomena
A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...
☆332Updated last week
minyoungg / platonic-rep
☆573Updated 3 months ago
kach / gradient-descent-the-ultimate-optimizer
Code for our NeurIPS 2022 paper
☆369Updated 2 years ago
nikhilvyas / SOAP
☆197Updated 7 months ago