mvvsmk / OptMLLinks
Welcome to OptML! This repository is designed for those new to MLIR and machine learning-based optimizations. As a compiler enthusiast, I wanted to create a platform for hobbyists like myself to experiment with and benchmark new optimizations on real ML models in an out-of-tree manner.
☆20Updated last year
Alternatives and similar repositories for OptML
Users that are interested in OptML are comparing it to the libraries listed below
Sorting:
- Tenstorrent MLIR compiler☆218Updated last week
- Tenstorrent's MLIR Based Compiler. We aim to enable developers to run AI on all configurations of Tenstorrent hardware, through an open-s…☆148Updated this week
- The TT-Forge FE is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their per…☆53Updated this week
- The missing pieces (as far as boilerplate reduction goes) of the upstream MLIR python bindings.☆115Updated last month
- IREE's PyTorch Frontend, based on Torch Dynamo.☆102Updated this week
- Repo for AI Compiler team. The intended purpose of this repo is for implementation of a PJRT device.☆47Updated this week
- ☆162Updated this week
- NVIDIA tools guide☆150Updated 11 months ago
- ☆27Updated 9 months ago
- Tenstorrent Kernel Module☆56Updated 2 weeks ago
- TPP experimentation on MLIR for linear algebra☆140Updated last week
- a simple end to end example of taking a ML graph (TF2 / PyTorch) and running it on a device [cpu, gpu]☆36Updated 4 years ago
- Nvidia Instruction Set Specification Generator☆304Updated last year
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆138Updated 11 months ago
- GPUOcelot: A dynamic compilation framework for PTX☆219Updated 10 months ago
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆123Updated last month
- IREE plugin repository for the AMD AIE accelerator☆115Updated this week
- Conversions to MLIR EmitC☆134Updated last year
- Unofficial description of the CUDA assembly (SASS) instruction sets.☆182Updated 5 months ago
- ☆85Updated this week
- An MLIR-based toolchain for AMD AI Engine-enabled devices.☆546Updated this week
- ☆84Updated last month
- MLIR Sample dialect☆132Updated 10 months ago
- IREE compiler and runtime for Snitch☆14Updated 2 months ago
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆145Updated last week
- A parser for PTX 6.5☆12Updated 2 years ago
- Fork of LLVM to support AMD AIEngine processors☆176Updated this week
- An interactive web-based tool for exploring intermediate representations of PyTorch and Triton models☆50Updated 2 weeks ago
- Super fast FP32 matrix multiplication on RDNA3☆81Updated 8 months ago
- Tutorial on building a gpu compiler backend in LLVM☆49Updated 11 months ago