spfrommer / torchexplorerLinks
Interactively inspect module inputs, outputs, parameters, and gradients.
☆338Updated 3 weeks ago
Alternatives and similar repositories for torchexplorer
Users that are interested in torchexplorer are comparing it to the libraries listed below
Sorting:
- Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793☆417Updated 3 weeks ago
- Annotated version of the Mamba paper☆482Updated last year
- Official Implementation of "ADOPT: Modified Adam Can Converge with Any β2 with the Optimal Rate"☆423Updated 5 months ago
- Package for extracting and mapping the results of every single tensor operation in a PyTorch model in one line of code.☆588Updated 3 months ago
- FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores☆319Updated 5 months ago
- Helpful tools and examples for working with flex-attention☆811Updated last week
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆546Updated this week
- Muon: An optimizer for hidden layers in neural networks☆678Updated last week
- The AdEMAMix Optimizer: Better, Faster, Older.☆183Updated 8 months ago
- Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch☆337Updated 11 months ago
- For optimization algorithm research and development.☆518Updated this week
- A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to fac…☆233Updated 4 months ago
- Implementation of a memory efficient multi-head attention as proposed in the paper, "Self-attention Does Not Need O(n²) Memory"☆379Updated last year
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆239Updated 3 months ago
- torchview: visualize pytorch models☆940Updated 2 weeks ago
- TensorDict is a pytorch dedicated tensor container.☆927Updated this week
- Implementation of the proposed minGRU in Pytorch☆296Updated 2 months ago
- Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch☆514Updated 3 weeks ago
- Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI☆282Updated 2 months ago
- When it comes to optimizers, it's always better to be safe than sorry☆237Updated 2 months ago
- TensorHue is a Python library that allows you to visualize tensors right in your console, making understanding and debugging tensor conte…☆116Updated 3 months ago
- ☆286Updated last month
- Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"☆554Updated 11 months ago
- Reading list for research topics in state-space models☆292Updated last week
- An extension of the nanoGPT repository for training small MOE models.☆147Updated 2 months ago
- optimizer & lr scheduler & loss function collections in PyTorch☆297Updated last week
- Implementation of https://srush.github.io/annotated-s4☆497Updated 2 years ago
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆120Updated 10 months ago
- Scalable and Performant Data Loading☆269Updated last week
- Thunder gives you PyTorch models superpowers for training and inference. Unlock out-of-the-box optimizations for performance, memory and …☆1,357Updated this week