google / minimalloc
A lightweight memory allocator for hardware-accelerated machine learning
☆140Updated 5 months ago
Alternatives and similar repositories for minimalloc:
Users that are interested in minimalloc are comparing it to the libraries listed below
- ☆134Updated this week
- MLIR-based partitioning system☆58Updated this week
- TPP experimentation on MLIR for linear algebra☆115Updated this week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆129Updated this week
- Conversions to MLIR EmitC☆126Updated last month
- ☆90Updated this week
- An experimental CPU backend for Triton☆81Updated last week
- IREE's PyTorch Frontend, based on Torch Dynamo.☆62Updated this week
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆127Updated last year
- MLIR Sample dialect☆108Updated last week
- A language and compiler for irregular tensor programs.☆134Updated 2 months ago
- We invite you to visit and follow our new repository at https://github.com/microsoft/TileFusion. TiledCUDA is a highly efficient kernel …☆175Updated this week
- A lightweight, Pythonic, frontend for MLIR☆80Updated last year
- Bridging polyhedral analysis tools to the MLIR framework☆107Updated last year
- A GPU-driven system framework for scalable AI applications☆111Updated last week
- Open source cross-platform compiler for compute-intensive loops used in AI algorithms, from Microsoft Research☆106Updated last year
- Experiments and prototypes associated with IREE or MLIR☆51Updated 5 months ago
- An out-of-tree MLIR dialect template.☆94Updated 4 months ago
- MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.com☆38Updated last year
- Tenstorrent MLIR compiler☆86Updated this week
- Play with MLIR right in your browser☆131Updated last year
- An extension library of WMMA API (Tensor Core API)☆87Updated 6 months ago
- ☆88Updated this week
- ☆25Updated 11 months ago
- A library for constructing allocators and memory pools. It also contains broadly useful abstractions and utilities for memory management.…☆49Updated this week
- The missing pieces (as far as boilerplate reduction goes) of the upstream MLIR python bindings.☆77Updated this week
- LLVM OpenCL C compiler suite for ventus GPGPU☆40Updated last week
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆112Updated 3 weeks ago
- A sandbox for quick iteration and experimentation on projects related to IREE, MLIR, and LLVM☆56Updated 4 months ago
- Dissecting NVIDIA GPU Architecture☆84Updated 2 years ago