PyTorch centric eager mode debugger
☆48Dec 16, 2024Updated last year
Alternatives and similar repositories for torchdbg
Users that are interested in torchdbg are comparing it to the libraries listed below
Sorting:
- TORCH_TRACE parser for PT2☆78Feb 26, 2026Updated last week
- ☆21Mar 3, 2025Updated last year
- Hacks for PyTorch☆19Apr 18, 2023Updated 2 years ago
- Reinforcement learning modular with pytorch☆11Jan 18, 2021Updated 5 years ago
- Read and write tensorboard data using Rust☆24Feb 4, 2024Updated 2 years ago
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Nov 11, 2024Updated last year
- Display tensors directly from GPU☆11Oct 12, 2025Updated 4 months ago
- 🤖 Implementation of Self Normalizing Networks (SNN) in PyTorch.☆12Jun 19, 2017Updated 8 years ago
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆32Jun 5, 2025Updated 9 months ago
- ☆16Feb 24, 2026Updated last week
- Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code☆10Aug 29, 2023Updated 2 years ago
- OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradi…☆15Updated this week
- Blindspots in LLMs I've noticed while AI coding. Sonnet family emphasis.☆13Mar 20, 2025Updated 11 months ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆18Oct 13, 2025Updated 4 months ago
- Experimental LLM interface exploring new ways to use AI to improve human thinking☆19Feb 27, 2026Updated last week
- Minimal Implimentation of VCRec (2024) for collapse provention.☆18Jan 28, 2025Updated last year
- One File Tensor Libraries☆30Oct 7, 2025Updated 5 months ago
- Visualising Losses in Deep Neural Networks☆16Jul 17, 2024Updated last year
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆21Sep 24, 2025Updated 5 months ago
- Implementation of a modular, high-performance, and simplistic mamba for high-speed applications☆40Nov 11, 2024Updated last year
- NewsAgent is an enterprise-grade news aggregation agent designed to fetch, query, and summarize news from multiple sources at scale.☆26Oct 13, 2025Updated 4 months ago
- Implementation of Influence Function approximations for differently sized ML models, using PyTorch☆16Sep 15, 2023Updated 2 years ago
- Write your code as tree-like expressions, then transform it☆21Jan 9, 2024Updated 2 years ago
- ☆19Dec 4, 2025Updated 3 months ago
- PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)☆20Jan 19, 2025Updated last year
- An experimental implementation of compiler-driven automatic sharding of models across a given device mesh.☆52Updated this week
- Graph Transformers for Large Graphs☆22Apr 26, 2024Updated last year
- ☆52Jun 10, 2024Updated last year
- An implementation of the Llama architecture, to instruct and delight☆21May 31, 2025Updated 9 months ago
- Scalable and Stable Parallelization of Nonlinear RNNS☆29Oct 21, 2025Updated 4 months ago
- ☆20Sep 22, 2023Updated 2 years ago
- Utilities for Training Very Large Models☆58Sep 25, 2024Updated last year
- A library to analyze PyTorch traces.☆472Feb 4, 2026Updated last month
- ☆23Jun 18, 2024Updated last year
- ☆20Jun 10, 2024Updated last year
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision☆36Dec 2, 2025Updated 3 months ago
- Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)☆24Jun 6, 2024Updated last year
- Memory Efficient Training Framework for Large Video Generation Model☆25Apr 22, 2024Updated last year
- Canvas: End-to-End Kernel Architecture Search in Neural Networks☆27Nov 18, 2024Updated last year