spfrommer / torchexplorer
Interactively inspect module inputs, outputs, parameters, and gradients.
☆323Updated 2 months ago
Alternatives and similar repositories for torchexplorer:
Users that are interested in torchexplorer are comparing it to the libraries listed below
- Annotated version of the Mamba paper☆474Updated last year
- torchview: visualize pytorch models☆882Updated this week
- Helpful tools and examples for working with flex-attention☆679Updated this week
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆520Updated 3 weeks ago
- FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores☆304Updated 2 months ago
- A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to fac…☆228Updated last month
- TensorDict is a pytorch dedicated tensor container.☆891Updated this week
- Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch☆506Updated 4 months ago
- For optimization algorithm research and development.☆498Updated 2 weeks ago
- ☆148Updated last year
- Official Implementation of "ADOPT: Modified Adam Can Converge with Any β2 with the Optimal Rate"☆417Updated 3 months ago
- Package for extracting and mapping the results of every single tensor operation in a PyTorch model in one line of code.☆554Updated last week
- Transform datasets at scale. Optimize datasets for fast AI model training.☆421Updated this week
- Muon optimizer: +>30% sample efficiency with <3% wallclock overhead☆494Updated this week
- Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch☆316Updated 8 months ago
- When it comes to optimizers, it's always better to be safe than sorry☆213Updated 2 weeks ago
- Build high-performance AI models with modular building blocks☆480Updated last week
- ☆768Updated last month
- Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793☆388Updated 3 months ago
- depyf is a tool to help you understand and adapt to PyTorch compiler torch.compile.☆600Updated 3 months ago
- Cataloging released Triton kernels.☆185Updated 2 months ago
- ☆287Updated 2 months ago
- The AdEMAMix Optimizer: Better, Faster, Older.☆178Updated 6 months ago
- A pytorch quantization backend for optimum☆897Updated last week
- optimizer & lr scheduler & loss function collections in PyTorch☆279Updated this week
- MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…☆370Updated this week
- Scalable and Performant Data Loading☆224Updated this week
- Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI☆274Updated 4 months ago
- Implementation of Diffusion Transformer (DiT) in JAX☆267Updated 9 months ago