spfrommer / torchexplorer
Interactively inspect module inputs, outputs, parameters, and gradients.
☆337Updated this week
Alternatives and similar repositories for torchexplorer
Users that are interested in torchexplorer are comparing it to the libraries listed below
Sorting:
- Helpful tools and examples for working with flex-attention☆766Updated last week
- torchview: visualize pytorch models☆928Updated 3 weeks ago
- Annotated version of the Mamba paper☆483Updated last year
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆536Updated this week
- Official Implementation of "ADOPT: Modified Adam Can Converge with Any β2 with the Optimal Rate"☆424Updated 5 months ago
- Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch☆331Updated 11 months ago
- A pytorch quantization backend for optimum☆935Updated 3 weeks ago
- TensorHue is a Python library that allows you to visualize tensors right in your console, making understanding and debugging tensor conte…☆116Updated 2 months ago
- LoRA and DoRA from Scratch Implementations☆202Updated last year
- FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores☆317Updated 4 months ago
- For optimization algorithm research and development.☆513Updated this week
- Package for extracting and mapping the results of every single tensor operation in a PyTorch model in one line of code.☆580Updated 2 months ago
- Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793☆412Updated this week
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆233Updated 2 months ago
- When it comes to optimizers, it's always better to be safe than sorry☆223Updated last month
- TensorDict is a pytorch dedicated tensor container.☆925Updated this week
- A easy, reliable, fluid template for python packages complete with docs, testing suites, readme's, github workflows, linting and much muc…☆173Updated last month
- Transform datasets at scale. Optimize datasets for fast AI model training.☆472Updated this week
- Muon optimizer: +>30% sample efficiency with <3% wallclock overhead☆623Updated last month
- Implementation of the proposed minGRU in Pytorch☆292Updated 2 months ago
- Universal Tensor Operations in Einstein-Inspired Notation for Python.☆369Updated last month
- FastKAN: Very Fast Implementation of Kolmogorov-Arnold Networks (KAN)☆407Updated 10 months ago
- [ICLR2025 Spotlight🔥] Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters☆555Updated 3 months ago
- Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"☆550Updated 4 months ago
- Implementation of a memory efficient multi-head attention as proposed in the paper, "Self-attention Does Not Need O(n²) Memory"☆378Updated last year
- The AdEMAMix Optimizer: Better, Faster, Older.☆183Updated 8 months ago
- The official implementation of Tensor ProducT ATTenTion Transformer (T6)☆368Updated last month
- Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"☆553Updated 10 months ago
- An extension of the nanoGPT repository for training small MOE models.☆142Updated 2 months ago
- Thunder gives you PyTorch models superpowers for training and inference. Unlock out-of-the-box optimizations for performance, memory and …☆1,342Updated this week