ezyang / tlparse

TORCH_LOGS parser for PT2

☆30

Alternatives and similar repositories for tlparse:

Users that are interested in tlparse are comparing it to the libraries listed below

pytorch-labs / tritonbench
Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.
☆75Updated this week
pytorch-labs / triton-cpu
An experimental CPU backend for Triton (https//github.com/openai/triton)
☆37Updated 8 months ago
yifuwang / symm-mem-recipes
☆27Updated 3 weeks ago
cchan / tccl
extensible collectives library in triton
☆76Updated 3 months ago
lianakoleva / no-libtorch-compile
☆21Updated 2 months ago
triton-lang / kernels
☆64Updated 2 months ago
microsoft / dist-ir
An IR for efficiently simulating distributed ML computation.
☆25Updated last year
microsoft / DeepSpeed-Kernels
☆57Updated 7 months ago
GindaChen / FlexFlashAttention3
FlexAttention w/ FlashAttention3 Support
☆27Updated 3 months ago
mlc-ai / mlc-python
☆21Updated last week
IBM / triton-dejavu
Framework to reduce autotune overhead to zero for well known deployments.
☆57Updated last month
LeiWang1999 / AutoGPTQ.tvm
GPTQ inference TVM kernel
☆38Updated 8 months ago
neuralmagic / compressed-tensors
A safetensors extension to efficiently store sparse quantized tensors on disk
☆64Updated this week
microsoft / TileFusion
☆36Updated this week
nunoplopes / torchy
A tracing JIT compiler for PyTorch
☆12Updated 3 years ago
ROCm / aotriton
Ahead of Time (AOT) Triton Math Library
☆50Updated this week
wangsiping97 / FastGEMV
High-speed GEMV kernels, at most 2.7x speedup compared to pytorch baseline.
☆93Updated 6 months ago
mobiusml / gemlite
Fast low-bit matmul kernels in Triton
☆187Updated last week
ColfaxResearch / cfx-article-src
☆66Updated 3 weeks ago
iree-org / iree-nvgpu
☆48Updated 10 months ago
stanford-futuredata / stk
☆96Updated 4 months ago
microsoft / torchy
A tracing JIT for PyTorch
☆17Updated 2 years ago
INT-FlashAttention2024 / INT-FlashAttention
☆56Updated 3 months ago
NVIDIA / free-threaded-python
No-GIL Python environment featuring NVIDIA Deep Learning libraries.
☆39Updated last month
makslevental / nelli
A lightweight, Pythonic, frontend for MLIR
☆80Updated last year
Jokeren / triton-samples
☆23Updated this week
iree-org / iree-llvm-sandbox
A sandbox for quick iteration and experimentation on projects related to IREE, MLIR, and LLVM
☆56Updated 4 months ago
ScalingIntelligence / KernelBench
KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA problems
☆91Updated this week
MDK8888 / vllmini
A minimal implementation of vllm.
☆32Updated 5 months ago
octoml / synr
A library for syntactically rewriting Python programs, pronounced (sinner).
☆70Updated 2 years ago