sachinhosmani / torchvistaLinks
Interactive Pytorch forward pass visualization in notebooks
☆514Updated this week
Alternatives and similar repositories for torchvista
Users that are interested in torchvista are comparing it to the libraries listed below
Sorting:
- This repository provides a Python script to fetch and summarize research papers from arXiv using the free Gemini API☆236Updated 5 months ago
- Code release for DynamicTanh (DyT)☆994Updated 4 months ago
- Interactively inspect module inputs, outputs, parameters, and gradients.☆347Updated 2 months ago
- Official repository of my book "A Hands-On Guide to Fine-Tuning LLMs with PyTorch and Hugging Face"☆465Updated 4 months ago
- First-principle implementations of groundbreaking AI algorithms using a wide range of deep learning frameworks, accompanied by supporting…☆178Updated 3 weeks ago
- Model Activity Visualiser☆517Updated 4 months ago
- Generate a comprehensive review from an arXiv paper, then turn it into a blog post. This project powers the website below for the Hugging…☆785Updated 5 months ago
- A fully functional and simple Machine Learning library made entirely from scratch with Python.☆295Updated last week
- Simple project page template for your research paper, built with Astro and Tailwind CSS☆377Updated 3 months ago
- Labs for MIT 6.S184/6.S975, IAP 2025☆185Updated 3 months ago
- Python Implementation of MUVERA (Multi-Vector Retrieval via Fixed Dimensional Encodings)☆278Updated last month
- ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations [COLM 2025]☆223Updated last month
- Fetch arxiv data to LLM-friendly text☆124Updated 5 months ago
- A Deep Research agent from scratch☆201Updated 2 months ago
- Curated resources for discovering, reading, and working with arXiv papers☆324Updated 2 months ago
- Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation☆389Updated last week
- A straightforward method for training your LLM, from downloading data to generating text.☆414Updated last week
- Implementation of Stable Diffusion with PyTorch☆347Updated 5 months ago
- Introduction to PyTorch, covering tensor initialization, operations, indexing, and reshaping.☆430Updated 2 weeks ago
- Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper☆712Updated 2 months ago
- Train a Language Model with GRPO to create a schedule from a list of events and priorities☆219Updated 3 months ago
- Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw☆513Updated 8 months ago
- The official implementation of TPA: Tensor ProducT ATTenTion Transformer (T6) (https://arxiv.org/abs/2501.06425)☆380Updated 2 weeks ago
- When it comes to optimizers, it's always better to be safe than sorry☆352Updated this week
- From scratch implementation of a vision language model in pure PyTorch☆234Updated last year
- Textbook on reinforcement learning from human feedback☆1,158Updated this week
- A collection of tricks and tools to speed up transformer models☆169Updated 2 months ago
- ☆166Updated last year
- Implementation of all RL algorithms in a simpler way☆1,013Updated 3 months ago
- Probability and Statistics for Data Science☆485Updated last week