google / gematria
Machine learning for machine code.
☆84Updated this week
Alternatives and similar repositories for gematria:
Users that are interested in gematria are comparing it to the libraries listed below
- MLIR-based partitioning system☆76Updated this week
- RDNA3 emulator☆54Updated this week
- Unified compiler/runtime for interfacing with PyTorch Dynamo.☆99Updated last month
- A sandbox for quick iteration and experimentation on projects related to IREE, MLIR, and LLVM☆56Updated last week
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆40Updated 2 weeks ago
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆132Updated this week
- TORCH_LOGS parser for PT2☆36Updated this week
- ☆138Updated this week
- The missing pieces (as far as boilerplate reduction goes) of the upstream MLIR python bindings.☆84Updated this week
- hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditiona…☆84Updated this week
- Intel® SHMEM - Device initiated shared memory based communication library☆23Updated this week
- A Top-Down Profiler for GPU Applications☆17Updated last year
- Bandwidth test for ROCm☆54Updated 2 weeks ago
- Experiments and prototypes associated with IREE or MLIR☆50Updated 7 months ago
- A library for constructing allocators and memory pools. It also contains broadly useful abstractions and utilities for memory management.…☆54Updated this week
- TPP experimentation on MLIR for linear algebra☆122Updated 2 weeks ago
- ☆94Updated this week
- ☆63Updated last week
- End to End steps for adding custom ops in PyTorch.☆21Updated 4 years ago
- IREE's PyTorch Frontend, based on Torch Dynamo.☆74Updated this week
- Write a fast kernel and run it on Discord. See how you compare against the best!☆35Updated this week
- An experimental CPU backend for Triton☆103Updated this week
- Utilities for constructing a large dataset of LLVM IR☆18Updated 7 months ago
- A tracing JIT compiler for PyTorch☆13Updated 3 years ago
- Unofficial description of the CUDA assembly (SASS) instruction sets.☆79Updated 3 weeks ago
- Benchmarks to capture important workloads.☆30Updated 2 months ago
- extensible collectives library in triton☆84Updated this week
- ☆57Updated 10 months ago
- An IR for efficiently simulating distributed ML computation.☆28Updated last year
- Conversions to MLIR EmitC☆128Updated 3 months ago