PrincetonUniversity / multi_gpu_training
☆258Updated 6 months ago
Related projects: ⓘ
- Building blocks for foundation models.☆347Updated 8 months ago
- Example of how to use Weights & Biases on Slurm☆108Updated 2 years ago
- Helps you write algorithms in PyTorch that adapt to the available (CUDA) memory☆419Updated 3 weeks ago
- FFCV-SSL Fast Forward Computer Vision for Self-Supervised Learning.☆199Updated last year
- TensorDict is a pytorch dedicated tensor container.☆808Updated this week
- Annotated version of the Mamba paper☆445Updated 6 months ago
- Helpful tools and examples for working with flex-attention☆341Updated last month
- ☆202Updated 4 months ago
- ☆182Updated last year
- Python 3.8+ toolbox for submitting jobs to Slurm☆1,255Updated this week
- Universal Tensor Operations in Einstein-Inspired Notation for Python.☆316Updated last month
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆493Updated this week
- Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch☆278Updated 3 months ago
- Implementation of a memory efficient multi-head attention as proposed in the paper, "Self-attention Does Not Need O(n²) Memory"☆355Updated last year
- MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…☆321Updated 2 weeks ago
- Code for our NeurIPS 2022 paper☆360Updated last year
- TorchOpt is an efficient library for differentiable optimization built upon PyTorch.☆528Updated this week
- Implementation of https://srush.github.io/annotated-s4☆457Updated last year
- FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores☆265Updated 7 months ago
- Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch☆528Updated last week
- A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...☆242Updated 3 weeks ago
- Load tensorboard event logs as pandas DataFrames for scientific plotting; Supports both PyTorch and TensorFlow☆173Updated last month
- Named tensors with first-class dimensions for PyTorch☆321Updated last year
- Train ImageNet *fast* in 500 lines of code with FFCV☆135Updated 4 months ago
- CLU lets you write beautiful training loops in JAX.☆319Updated 3 weeks ago
- BackPACK - a backpropagation package built on top of PyTorch which efficiently computes quantities other than the gradient.☆555Updated 4 months ago
- For optimization algorithm research and development.☆240Updated last week
- Betty: an automatic differentiation library for generalized meta-learning and multilevel optimization☆330Updated 2 months ago
- A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to fac…☆213Updated this week
- ☆259Updated this week