sachinhosmani / torchvistaLinks
Interactive Pytorch forward pass visualization in notebooks
☆607Updated this week
Alternatives and similar repositories for torchvista
Users that are interested in torchvista are comparing it to the libraries listed below
Sorting:
- This repository provides a Python script to fetch and summarize research papers from arXiv using the free Gemini API☆250Updated 8 months ago
- Official repository of my book "A Hands-On Guide to Fine-Tuning LLMs with PyTorch and Hugging Face"☆560Updated last month
- Code release for DynamicTanh (DyT)☆1,021Updated 7 months ago
- Interactively inspect module inputs, outputs, parameters, and gradients.☆354Updated 6 months ago
- A comprehensive book on neural networks and large language models in NLP☆423Updated last month
- Learning Deep Representations of Data Distributions☆614Updated this week
- Contains the public resources of Hands on GenAI book☆214Updated 10 months ago
- Model Activity Visualiser☆519Updated 7 months ago
- A fully functional and simple Machine Learning library made entirely from scratch with Python.☆321Updated last week
- Implementation of Stable Diffusion with PyTorch☆359Updated 9 months ago
- Minimal and annotated implementations of key ideas from modern deep learning research.☆1,204Updated last month
- First-principle implementations of groundbreaking AI algorithms using a wide range of deep learning frameworks, accompanied by supporting…☆179Updated 4 months ago
- About This repository is a curated collection of the most exciting and influential CVPR 2025 papers. 🔥 [Paper + Code + Demo]☆812Updated 5 months ago
- Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw☆570Updated 11 months ago
- Generate a comprehensive review from an arXiv paper, then turn it into a blog post. This project powers the website below for the Hugging…☆806Updated 9 months ago
- Muon is an optimizer for hidden layers in neural networks☆2,028Updated 4 months ago
- Static suckless single batch CUDA-only qwen3-0.6B mini inference engine☆512Updated 2 months ago
- Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper☆780Updated 3 months ago
- Textbook on reinforcement learning from human feedback☆1,329Updated this week
- From scratch implementation of a vision language model in pure PyTorch☆250Updated last year
- When it comes to optimizers, it's always better to be safe than sorry☆379Updated last month
- Fetch arxiv data to LLM-friendly text☆126Updated 8 months ago
- Train a Language Model with GRPO to create a schedule from a list of events and priorities☆247Updated 6 months ago
- A Deep Research agent from scratch☆212Updated 6 months ago
- Continuous Thought Machines, because thought takes time and reasoning is a process.☆1,401Updated last month
- A minimal, easy-to-read PyTorch reimplementation of the Qwen3 and Qwen2.5 VL with a fancy CLI☆191Updated last week
- A straightforward method for training your LLM, from downloading data to generating text.☆474Updated 3 months ago
- Automatic Video Generation from Scientific Papers☆1,551Updated last month
- All-in-one training for vision models (YOLO, ViTs, RT-DETR, DINOv3): pretraining, fine-tuning, distillation.☆1,109Updated this week
- ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations [COLM 2025]☆242Updated 4 months ago