Stonesjtu / pytorch-learningLinks
learning notes when learning the source code of pytorch
☆24Updated 6 years ago
Alternatives and similar repositories for pytorch-learning
Users that are interested in pytorch-learning are comparing it to the libraries listed below
Sorting:
- Efficient, check-pointed data loading for deep learning with massive data sets.☆211Updated 2 years ago
- Implementation of a Transformer, but completely in Triton☆279Updated 3 years ago
- Torch Distributed Experimental☆117Updated last year
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆164Updated 3 weeks ago
- Distributed ML Optimizer☆35Updated 4 years ago
- ☆252Updated last year
- ☆124Updated last year
- Profile the GPU memory usage of every line in a Pytorch code☆83Updated 7 years ago
- Research and development for optimizing transformers☆131Updated 4 years ago
- PyTorch RFCs (experimental)☆138Updated 8 months ago
- Fast Discounted Cumulative Sums in PyTorch☆97Updated 4 years ago
- ☆150Updated 2 years ago
- Simple implementation of Speculative Sampling in NumPy for GPT-2.☆99Updated 2 years ago
- Simple and efficient pytorch-native transformer training and inference (batched)☆79Updated last year
- some common Huggingface transformers in maximal update parametrization (µP)☆87Updated 3 years ago
- ☆192Updated last week
- Training neural networks in TensorFlow 2.0 with 5x less memory☆137Updated 3 years ago
- ☆115Updated last year
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆69Updated last year
- 🚀 Collection of components for development, training, tuning, and inference of foundation models leveraging PyTorch native components.☆219Updated last week
- Scalable PaLM implementation of PyTorch☆190Updated 3 years ago
- A fusion of a linear layer and a cross entropy loss, written for pytorch in triton.☆75Updated last year
- A minimal PyTorch Lightning OpenAI GPT w DeepSpeed Training!☆112Updated 2 years ago
- Train very large language models in Jax.☆210Updated 2 years ago
- JORA: JAX Tensor-Parallel LoRA Library (ACL 2024)☆36Updated last year
- ☆125Updated last year
- Python pdb for multiple processes☆80Updated 8 months ago
- Context Manager to profile the forward and backward times of PyTorch's nn.Module☆83Updated 2 years ago
- Implementation of a Tensorflow XLA rematerialization pass☆15Updated 6 years ago
- Pytorch library for factorized L0-based pruning.☆45Updated 2 years ago