coderonion / awesome-mojo-max-mlirLinks
A collection of some awesome public MAX platform, Mojo programming language and Multi-Level IR Compiler Framework(MLIR) projects.
☆40Updated last year
Alternatives and similar repositories for awesome-mojo-max-mlir
Users that are interested in awesome-mojo-max-mlir are comparing it to the libraries listed below
Sorting:
- Tenstorrent MLIR compiler☆243Updated this week
- port of Andrjey Karpathy's llm.c to Mojo☆362Updated 5 months ago
- High-Performance FP32 GEMM on CUDA devices☆117Updated last year
- MLIR-based partitioning system☆162Updated this week
- Multi-Threaded FP32 Matrix Multiplication on x86 CPUs☆376Updated 9 months ago
- A Machine Learning framework from scratch in Pure Mojo 🔥☆441Updated last year
- Convert StableHLO models into Apple Core ML format☆21Updated last week
- Machine Learning algorithms in pure Mojo 🔥☆62Updated last week
- Unified compiler/runtime for interfacing with PyTorch Dynamo.☆104Updated last month
- ☆88Updated 2 weeks ago
- IREE's PyTorch Frontend, based on Torch Dynamo.☆105Updated last week
- Python interface for MLIR - the Multi-Level Intermediate Representation☆273Updated last year
- Machine Learning library for the emerging Mojo/Python ecosystem☆316Updated last week
- C API for MLX☆170Updated 3 weeks ago
- GPUOcelot: A dynamic compilation framework for PTX☆219Updated 11 months ago
- The missing pieces (as far as boilerplate reduction goes) of the upstream MLIR python bindings.☆117Updated 2 months ago
- LLM training in simple, raw C/CUDA☆112Updated last year
- An experimental CPU backend for Triton☆173Updated 2 months ago
- ☆29Updated last year
- Backward compatible ML compute opset inspired by HLO/MHLO☆598Updated 2 weeks ago
- Learn GPU Programming in Mojo🔥 by Solving Puzzles☆284Updated this week
- Fast and Furious AMD Kernels☆346Updated last week
- The TT-Forge ONNX is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their p…☆53Updated this week
- A Python compiler design toolkit.☆477Updated this week
- Custom PTX Instruction Benchmark☆138Updated 11 months ago
- An interactive web-based tool for exploring intermediate representations of PyTorch and Triton models☆50Updated last week
- Repo for AI Compiler team. The intended purpose of this repo is for implementation of a PJRT device.☆51Updated this week
- Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.☆440Updated last month
- MLIR metal dialect☆36Updated last year
- TritonParse: A Compiler Tracer, Visualizer, and Reproducer for Triton Kernels☆189Updated this week