google / gematria
Machine learning for machine code.
☆82Updated this week
Alternatives and similar repositories for gematria:
Users that are interested in gematria are comparing it to the libraries listed below
- A sandbox for quick iteration and experimentation on projects related to IREE, MLIR, and LLVM☆56Updated 4 months ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆37Updated 8 months ago
- MLIR-based partitioning system☆56Updated this week
- The missing pieces (as far as boilerplate reduction goes) of the upstream MLIR python bindings.☆75Updated this week
- ☆131Updated this week
- ☆89Updated this week
- Experiments and prototypes associated with IREE or MLIR☆51Updated 5 months ago
- An IR for efficiently simulating distributed ML computation.☆25Updated last year
- An experimental CPU backend for Triton☆75Updated this week
- Stores documents and resources used by the OpenXLA developer community☆113Updated 5 months ago
- Utilities for constructing a large dataset of LLVM IR☆16Updated 5 months ago
- IREE's PyTorch Frontend, based on Torch Dynamo.☆59Updated this week
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆22Updated 3 months ago
- TORCH_LOGS parser for PT2☆30Updated last week
- End to End steps for adding custom ops in PyTorch.☆19Updated 4 years ago
- Development repository for the Triton language and compiler☆102Updated this week
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆291Updated this week
- Random number library that generate pseudo-random and quasi-random numbers.☆25Updated this week
- Benchmarks to capture important workloads.☆29Updated this week
- GPUOcelot: A dynamic compilation framework for PTX☆157Updated 3 weeks ago
- ☆22Updated 3 weeks ago
- Bandwidth test for ROCm☆52Updated this week
- A Top-Down Profiler for GPU Applications☆14Updated 10 months ago
- hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditiona…☆73Updated this week
- ☆64Updated 2 months ago
- A GPU-driven system framework for scalable AI applications☆111Updated 3 months ago
- A tracing JIT for PyTorch☆17Updated 2 years ago
- A lightweight memory allocator for hardware-accelerated machine learning☆139Updated 5 months ago
- CUDA Templates for Linear Algebra Subroutines☆11Updated this week