Stonesjtu / pytorch-learningLinks
learning notes when learning the source code of pytorch
☆24Updated 6 years ago
Alternatives and similar repositories for pytorch-learning
Users that are interested in pytorch-learning are comparing it to the libraries listed below
Sorting:
- Efficient, check-pointed data loading for deep learning with massive data sets.☆208Updated 2 years ago
- Implementation of a Transformer, but completely in Triton☆273Updated 3 years ago
- Fast Discounted Cumulative Sums in PyTorch☆96Updated 3 years ago
- ☆251Updated last year
- Research and development for optimizing transformers☆129Updated 4 years ago
- Profile the GPU memory usage of every line in a Pytorch code☆83Updated 7 years ago
- Distributed ML Optimizer☆32Updated 4 years ago
- A minimal PyTorch Lightning OpenAI GPT w DeepSpeed Training!☆112Updated 2 years ago
- Torch Distributed Experimental☆117Updated last year
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆158Updated last month
- ☆66Updated 4 months ago
- Official code for "Distributed Deep Learning in Open Collaborations" (NeurIPS 2021)☆117Updated 3 years ago
- A case study of efficient training of large language models using commodity hardware.☆68Updated 3 years ago
- ☆114Updated last year
- Python pdb for multiple processes☆51Updated 2 months ago
- Train very large language models in Jax.☆206Updated last year
- ☆187Updated last week
- ☆361Updated last year
- FairSeq repo with Apollo optimizer☆114Updated last year
- Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)☆101Updated 4 years ago
- ☆147Updated 2 years ago
- Transformers without Tears: Improving the Normalization of Self-Attention☆132Updated last year
- some common Huggingface transformers in maximal update parametrization (µP)☆82Updated 3 years ago
- PyTorch implementation of L2L execution algorithm☆107Updated 2 years ago
- This repository contains example code to build models on TPUs☆30Updated 2 years ago
- See details in https://github.com/pytorch/xla/blob/r1.12/torch_xla/distributed/fsdp/README.md☆24Updated 2 years ago
- Context Manager to profile the forward and backward times of PyTorch's nn.Module☆83Updated last year
- Training neural networks in TensorFlow 2.0 with 5x less memory☆132Updated 3 years ago
- Simple and efficient pytorch-native transformer training and inference (batched)☆78Updated last year
- Block-sparse primitives for PyTorch☆157Updated 4 years ago