spfrommer / torchexplorer
Interactively inspect module inputs, outputs, parameters, and gradients.
☆334Updated 4 months ago
Alternatives and similar repositories for torchexplorer:
Users that are interested in torchexplorer are comparing it to the libraries listed below
- Annotated version of the Mamba paper☆483Updated last year
- Package for extracting and mapping the results of every single tensor operation in a PyTorch model in one line of code.☆576Updated last month
- Helpful tools and examples for working with flex-attention☆726Updated 2 weeks ago
- torchview: visualize pytorch models☆908Updated this week
- Official Implementation of "ADOPT: Modified Adam Can Converge with Any β2 with the Optimal Rate"☆425Updated 4 months ago
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆534Updated this week
- For optimization algorithm research and development.☆508Updated this week
- Transform datasets at scale. Optimize datasets for fast AI model training.☆449Updated last week
- A pytorch quantization backend for optimum☆922Updated last week
- TensorDict is a pytorch dedicated tensor container.☆911Updated this week
- Muon optimizer: +>30% sample efficiency with <3% wallclock overhead☆577Updated last month
- MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…☆375Updated last week
- depyf is a tool to help you understand and adapt to PyTorch compiler torch.compile.☆650Updated this week
- Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793☆407Updated last week
- FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores☆317Updated 3 months ago
- Implementation of Diffusion Transformer (DiT) in JAX☆272Updated 10 months ago
- Implementation of the proposed minGRU in Pytorch☆286Updated last month
- Scalable and Performant Data Loading☆237Updated last week
- The AdEMAMix Optimizer: Better, Faster, Older.☆180Updated 7 months ago
- UNet diffusion model in pure CUDA☆602Updated 9 months ago
- ☆133Updated last year
- From scratch implementation of a vision language model in pure PyTorch☆213Updated 11 months ago
- A easy, reliable, fluid template for python packages complete with docs, testing suites, readme's, github workflows, linting and much muc…☆170Updated 2 weeks ago
- TensorHue is a Python library that allows you to visualize tensors right in your console, making understanding and debugging tensor conte…☆115Updated 2 months ago
- ☆290Updated 4 months ago
- Implementation of a memory efficient multi-head attention as proposed in the paper, "Self-attention Does Not Need O(n²) Memory"☆377Updated last year
- A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to fac…☆229Updated 3 months ago
- optimizer & lr scheduler & loss function collections in PyTorch☆289Updated this week
- ☆175Updated 4 months ago
- Pipeline Parallelism for PyTorch☆764Updated 8 months ago