ezyang / torchdbg
PyTorch centric eager mode debugger
☆43Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for torchdbg
- Experiment of using Tangent to autodiff triton☆72Updated 9 months ago
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆35Updated 4 months ago
- TORCH_LOGS parser for PT2☆22Updated last week
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆98Updated 2 months ago
- Make triton easier☆41Updated 5 months ago
- TorchFix - a linter for PyTorch-using code with autofix support☆103Updated this week
- A place to store reusable transformer components of my own creation or found on the interwebs☆44Updated 2 weeks ago
- CUDA implementation of autoregressive linear attention, with all the latest research findings☆43Updated last year
- extensible collectives library in triton☆71Updated last month
- A library for unit scaling in PyTorch☆105Updated 2 weeks ago
- An implementation of the Llama architecture, to instruct and delight☆21Updated 3 months ago
- ☆73Updated 4 months ago
- Torch Distributed Experimental☆116Updated 3 months ago
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆146Updated 2 weeks ago
- Pragmatic approach to parsing import profiles for CI's☆11Updated 4 months ago
- ☆17Updated 3 weeks ago
- Hacks for PyTorch☆17Updated last year
- Utilities for Training Very Large Models☆56Updated last month
- Context Manager to profile the forward and backward times of PyTorch's nn.Module☆83Updated last year
- ☆77Updated 5 months ago
- A case study of efficient training of large language models using commodity hardware.☆68Updated 2 years ago
- Multidimensional indexing for tensors☆113Updated last year
- Utilities for PyTorch distributed☆23Updated last year
- This is a port of Mistral-7B model in JAX☆30Updated 4 months ago
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆107Updated last year
- Scalable neural net training via automatic normalization in the modular norm.☆121Updated 3 months ago
- ML/DL Math and Method notes☆57Updated 11 months ago
- PyTorch video decoding☆79Updated this week
- ☆40Updated 4 months ago
- ☆267Updated this week