sachinhosmani / torchvistaLinks
Interactive Pytorch forward pass visualization in notebooks
☆591Updated last week
Alternatives and similar repositories for torchvista
Users that are interested in torchvista are comparing it to the libraries listed below
Sorting:
- Code release for DynamicTanh (DyT)☆1,016Updated 6 months ago
- This repository provides a Python script to fetch and summarize research papers from arXiv using the free Gemini API☆244Updated 7 months ago
- Official repository of my book "A Hands-On Guide to Fine-Tuning LLMs with PyTorch and Hugging Face"☆525Updated last week
- Interactively inspect module inputs, outputs, parameters, and gradients.☆352Updated 4 months ago
- A comprehensive book on neural networks and large language models in NLP☆360Updated last week
- A straightforward method for training your LLM, from downloading data to generating text.☆442Updated 2 months ago
- About This repository is a curated collection of the most exciting and influential CVPR 2025 papers. 🔥 [Paper + Code + Demo]☆788Updated 3 months ago
- Model Activity Visualiser☆518Updated 6 months ago
- Learning Deep Representations of Data Distributions☆470Updated last week
- Building DeepSeek R1 from Scratch☆704Updated 6 months ago
- First-principle implementations of groundbreaking AI algorithms using a wide range of deep learning frameworks, accompanied by supporting…☆177Updated 2 months ago
- A fully functional and simple Machine Learning library made entirely from scratch with Python.☆298Updated 2 months ago
- A minimal, easy-to-read PyTorch reimplementation of the Qwen3 and Qwen2.5 VL with a fancy CLI☆167Updated last month
- Minimal and annotated implementations of key ideas from modern deep learning research.☆1,148Updated 2 weeks ago
- Labs for MIT 6.S184/6.S975, IAP 2025☆227Updated 5 months ago
- Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation (NeurIPS 2025)☆468Updated 2 weeks ago
- Machine Learning Q and AI book☆663Updated last month
- Static suckless single batch CUDA-only qwen3-0.6B mini inference engine☆499Updated last month
- All-in-one training for vision models (YOLO, ViTs, RT-DETR, DINOv3): pretraining, fine-tuning, distillation.☆925Updated this week
- Generate a comprehensive review from an arXiv paper, then turn it into a blog post. This project powers the website below for the Hugging…☆793Updated 7 months ago
- Muon is an optimizer for hidden layers in neural networks☆1,827Updated 3 months ago
- When it comes to optimizers, it's always better to be safe than sorry☆375Updated 2 weeks ago
- Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw☆555Updated 10 months ago
- [NeurIPS 2025] Open-source Multi-agent Poster Generation from Papers☆2,668Updated 2 weeks ago
- Continuous Thought Machines, because thought takes time and reasoning is a process.☆1,347Updated 2 months ago
- Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper☆765Updated last month
- This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mast…☆391Updated 7 months ago
- Contains the public resources of Hands on GenAI book☆196Updated 9 months ago
- A collection of tricks and tools to speed up transformer models☆182Updated this week
- Official PyTorch implementation of the paper "Dataset Distillation with Neural Characteristic Function: A Minmax Perspective" (NCFM) in C…☆389Updated 2 weeks ago