sachinhosmani / torchvistaLinks
Interactive Pytorch forward pass visualization in notebooks
☆667Updated this week
Alternatives and similar repositories for torchvista
Users that are interested in torchvista are comparing it to the libraries listed below
Sorting:
- This repository provides a Python script to fetch and summarize research papers from arXiv using the free Gemini API☆253Updated 10 months ago
- Code release for DynamicTanh (DyT)☆1,031Updated 9 months ago
- Interactively inspect module inputs, outputs, parameters, and gradients.☆354Updated 2 weeks ago
- A minimal PyTorch re-implementation of Qwen3 VL with a fancy CLI☆297Updated last month
- A comprehensive book on neural networks and large language models in NLP☆533Updated last month
- Official repository of my book "A Hands-On Guide to Fine-Tuning LLMs with PyTorch and Hugging Face"☆763Updated 3 months ago
- Model Activity Visualiser☆520Updated 9 months ago
- A straightforward method for training your LLM, from downloading data to generating text.☆503Updated 5 months ago
- Muon is an optimizer for hidden layers in neural networks☆2,179Updated last month
- [NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers☆3,029Updated 2 weeks ago
- Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation (NeurIPS 2025)☆529Updated 3 months ago
- About This repository is a curated collection of the most exciting and influential CVPR 2025 papers. 🔥 [Paper + Code + Demo]☆832Updated 6 months ago
- Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw☆582Updated last year
- Static suckless single batch CUDA-only qwen3-0.6B mini inference engine☆538Updated 4 months ago
- Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper☆791Updated 4 months ago
- LeetGPU Challenges☆574Updated this week
- [NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6) (https://arxiv.org/abs/2501.06425)☆443Updated 3 weeks ago
- Minimal and annotated implementations of key ideas from modern deep learning research.☆1,217Updated 3 months ago
- Automatically crawl arXiv papers daily and summarize them using AI. Illustrating them using GitHub Pages.☆2,245Updated this week
- CUDA tutorials for Maths & ML tutorials with examples, covers multi-gpus, fused attention, winograd convolution, reinforcement learning.☆206Updated 6 months ago
- Implementation of Stable Diffusion with PyTorch☆360Updated 10 months ago
- Official PyTorch implementation of the paper "Dataset Distillation with Neural Characteristic Function: A Minmax Perspective" (NCFM) in C…☆399Updated 2 weeks ago
- Speed Always Wins: A Survey on Efficient Architectures for Large Language Models☆380Updated last month
- First-principle implementations of groundbreaking AI algorithms using a wide range of deep learning frameworks, accompanied by supporting…☆181Updated 5 months ago
- From scratch implementation of a vision language model in pure PyTorch☆252Updated last year
- A collection of tricks and tools to speed up transformer models☆194Updated 3 weeks ago
- A curated list of materials on AI efficiency☆202Updated 3 weeks ago
- Building DeepSeek R1 from Scratch☆735Updated 9 months ago
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆4,494Updated 2 months ago
- Textbook on reinforcement learning from human feedback☆1,382Updated last week