sachinhosmani / torchvistaLinks
Interactive Pytorch forward pass visualization in notebooks
☆659Updated last week
Alternatives and similar repositories for torchvista
Users that are interested in torchvista are comparing it to the libraries listed below
Sorting:
- Code release for DynamicTanh (DyT)☆1,027Updated 8 months ago
- Interactively inspect module inputs, outputs, parameters, and gradients.☆354Updated 7 months ago
- This repository provides a Python script to fetch and summarize research papers from arXiv using the free Gemini API☆252Updated 9 months ago
- Official repository of my book "A Hands-On Guide to Fine-Tuning LLMs with PyTorch and Hugging Face"☆572Updated 2 months ago
- Model Activity Visualiser☆520Updated 8 months ago
- Learning Deep Representations of Data Distributions☆677Updated this week
- Implementation of Stable Diffusion with PyTorch☆360Updated 9 months ago
- A minimal PyTorch re-implementation of Qwen3 VL with a fancy CLI☆277Updated last week
- A fully functional and simple Machine Learning library made entirely from scratch with Python.☆324Updated last month
- Labs for MIT 6.S184/6.S975, IAP 2025☆268Updated last week
- From scratch implementation of a vision language model in pure PyTorch☆252Updated last year
- Static suckless single batch CUDA-only qwen3-0.6B mini inference engine☆530Updated 3 months ago
- First-principle implementations of groundbreaking AI algorithms using a wide range of deep learning frameworks, accompanied by supporting…☆180Updated 4 months ago
- Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw☆574Updated last year
- Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper☆789Updated 4 months ago
- Minimal and annotated implementations of key ideas from modern deep learning research.☆1,205Updated 2 months ago
- When it comes to optimizers, it's always better to be safe than sorry☆396Updated 2 months ago
- A straightforward method for training your LLM, from downloading data to generating text.☆487Updated 4 months ago
- Muon is an optimizer for hidden layers in neural networks☆2,098Updated 3 weeks ago
- A curated list of materials on AI efficiency☆195Updated last month
- Contains the public resources of Hands on GenAI book☆219Updated 11 months ago
- This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mast…☆423Updated 9 months ago
- A collection of tricks and tools to speed up transformer models☆192Updated last month
- Train a Language Model with GRPO to create a schedule from a list of events and priorities☆250Updated 7 months ago
- Building DeepSeek R1 from Scratch☆727Updated 8 months ago
- CUDA tutorials for Maths & ML tutorials with examples, covers multi-gpus, fused attention, winograd convolution, reinforcement learning.☆205Updated 6 months ago
- A command-line interface tool for serving LLM using vLLM.☆456Updated last week
- [NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6) (https://arxiv.org/abs/2501.06425)☆433Updated last month
- Interactive visualizations of the geometric intuition behind diffusion models.☆906Updated 5 months ago
- Simple and readable code for training and sampling from diffusion models☆666Updated 6 months ago