iree-org / ireeLinks
A retargetable MLIR-based machine learning compiler and runtime toolkit.
☆3,241Updated last week
Alternatives and similar repositories for iree
Users that are interested in iree are comparing it to the libraries listed below
Sorting:
- The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.☆1,591Updated last week
- A machine learning compiler for GPUs, CPUs, and ML accelerators☆3,383Updated this week
- Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure☆887Updated this week
- High-efficiency floating-point neural network inference operators for mobile, server, and Web☆2,072Updated last week
- Backward compatible ML compute opset inspired by HLO/MHLO☆510Updated last week
- "Multi-Level Intermediate Representation" Compiler Infrastructure☆1,752Updated 4 years ago
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,415Updated this week
- A list of awesome compiler projects and papers for tensor computation and deep learning.☆2,617Updated 9 months ago
- Compiler for Neural Network hardware accelerators☆3,310Updated last year
- CUDA Core Compute Libraries☆1,805Updated last week
- MLIR For Beginners tutorial☆1,029Updated 2 weeks ago
- CUDA Templates for Linear Algebra Subroutines☆8,149Updated this week
- Infrastructure for Machine Learning Guided Optimization (MLGO) in LLVM.☆709Updated this week
- A performant and modular runtime for TensorFlow☆758Updated 3 months ago
- Intel staging area for llvm.org contribution. Home for Intel LLVM-based projects.☆1,357Updated this week
- Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisati…☆1,579Updated 2 months ago
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆886Updated 7 months ago
- oneAPI Deep Neural Network Library (oneDNN)☆3,856Updated this week
- Tile primitives for speedy kernels☆2,541Updated this week
- ☆420Updated this week
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆992Updated 10 months ago
- SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX R…☆2,461Updated last week
- common in-memory tensor structure☆1,042Updated last month
- Reference implementations of MLPerf™ inference benchmarks☆1,420Updated last week
- The Tensor Algebra SuperOptimizer for Deep Learning☆726Updated 2 years ago
- An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).☆611Updated this week
- Low-precision matrix multiplication☆1,812Updated last year
- The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs☆1,314Updated 3 months ago
- ☆1,902Updated 2 years ago
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,765Updated last year