arakhmati / torchtrail
torchtrail: trace the graph of torch functions and modules for visualization, reports, etc
☆25Updated 7 months ago
Alternatives and similar repositories for torchtrail:
Users that are interested in torchtrail are comparing it to the libraries listed below
- TT-NN operator library, and TT-Metalium low level kernel programming model.☆601Updated this week
- SGEMM that beats cuBLAS☆68Updated last week
- extensible collectives library in triton☆77Updated 4 months ago
- LLM training in simple, raw C/CUDA☆91Updated 8 months ago
- Experiment of using Tangent to autodiff triton☆74Updated last year
- ☆171Updated last week
- ☆15Updated 4 months ago
- seqax = sequence modeling + JAX☆136Updated 6 months ago
- Attention in SRAM on Tenstorrent Grayskull☆31Updated 6 months ago
- KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA problems☆99Updated last week
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆240Updated this week
- ☆21Updated 3 months ago
- A safetensors extension to efficiently store sparse quantized tensors on disk☆66Updated this week
- Tenstorrent MLIR compiler☆86Updated this week
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆40Updated last year
- Explore training for quantized models☆13Updated 3 weeks ago
- ☆85Updated 11 months ago
- ⭐️ TTNN Compiler for PyTorch 2.0 ⭐️ It enables running PyTorch2.0 models on Tenstorrent hardware☆29Updated this week
- Make triton easier☆44Updated 7 months ago
- Fast Matrix Multiplications for Lookup Table-Quantized LLMs☆221Updated last week
- PyTorch centric eager mode debugger☆44Updated last month
- ☆49Updated 5 months ago
- Fastest kernels written from scratch☆131Updated 2 months ago
- ☆64Updated 2 months ago
- Boosting 4-bit inference kernels with 2:4 Sparsity☆64Updated 4 months ago
- FlexAttention w/ FlashAttention3 Support☆27Updated 3 months ago
- IREE's PyTorch Frontend, based on Torch Dynamo.☆62Updated this week
- ☆48Updated 10 months ago
- ☆21Updated this week
- 🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.☆58Updated last week