TORCH_TRACE parser for PT2
☆78Mar 17, 2026Updated this week
Alternatives and similar repositories for tlparse
Users that are interested in tlparse are comparing it to the libraries listed below
Sorting:
- PyTorch centric eager mode debugger☆48Dec 16, 2024Updated last year
- ☆21Mar 3, 2025Updated last year
- ICSE2021 Submission☆13Aug 28, 2022Updated 3 years ago
- Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.☆330Mar 14, 2026Updated last week
- ☆42Dec 10, 2024Updated last year
- TritonParse: A Compiler Tracer, Visualizer, and Reproducer for Triton Kernels☆196Updated this week
- depyf is a tool to help you understand and adapt to PyTorch compiler torch.compile.☆797Oct 13, 2025Updated 5 months ago
- extensible collectives library in triton☆97Mar 31, 2025Updated 11 months ago
- ☆20Sep 22, 2023Updated 2 years ago
- MSLK (Meta Superintelligence Labs Kernels) is a collection of PyTorch GPU operator libraries that are designed and optimized for GenAI tr…☆71Updated this week
- [DAC2024] A Holistic Functionalization Approach to Optimizing Imperative Tensor Programs in Deep Learning☆15Jan 13, 2024Updated 2 years ago
- Triton-based Symmetric Memory operators and examples☆91Jan 15, 2026Updated 2 months ago
- Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)☆487Updated this week
- Artifacts of EVT ASPLOS'24☆30Mar 6, 2024Updated 2 years ago
- ☆163Dec 27, 2024Updated last year
- A library to analyze PyTorch traces.☆474Updated this week
- PyTorch RFCs (experimental)☆140May 26, 2025Updated 9 months ago
- Mirage Persistent Kernel: Compiling LLMs into a MegaKernel☆2,159Updated this week
- Compiler for Dynamic Neural Networks☆45Nov 13, 2023Updated 2 years ago
- Standalone commandline CLI tool for compiling Triton kernels☆20Sep 13, 2024Updated last year
- PPX for template strings☆14Nov 17, 2018Updated 7 years ago
- supporting pytorch FSDP for optimizers☆84Dec 8, 2024Updated last year
- ☆11Dec 9, 2025Updated 3 months ago
- Backward compatible ML compute opset inspired by HLO/MHLO☆628Updated this week
- FlagGems is an operator library for large language models implemented in the Triton Language.☆917Updated this week
- Simple python library for generating your own perfetto traces for your application. Can be used for both app instrumentation and custom …☆25Jun 22, 2025Updated 8 months ago
- ☆16Feb 24, 2026Updated 3 weeks ago
- An implementation of the Llama architecture, to instruct and delight☆21May 31, 2025Updated 9 months ago
- Samples of good AI generated CUDA kernels☆102May 30, 2025Updated 9 months ago
- Development containers for triton and triton-cpu☆24Mar 9, 2026Updated last week
- Tensor Compute Primitives: Mid-level Intermediate Representation for Machine Learning Programs☆35Jan 30, 2025Updated last year
- ☆191Jun 16, 2024Updated last year
- CUDA Template Functions☆20Dec 16, 2025Updated 3 months ago
- ☆87Jan 23, 2025Updated last year
- Tile-based language built for AI computation across all scales☆138Updated this week
- ☆12Aug 26, 2025Updated 6 months ago
- A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.☆803Updated this week
- Unified compiler/runtime for interfacing with PyTorch Dynamo.☆103Dec 22, 2025Updated 2 months ago
- ☆139Aug 18, 2025Updated 7 months ago