tensorflow / mlir-hloLinks
☆422Updated this week
Alternatives and similar repositories for mlir-hlo
Users that are interested in mlir-hlo are comparing it to the libraries listed below
Sorting:
- ☆192Updated 2 years ago
- Backward compatible ML compute opset inspired by HLO/MHLO☆579Updated this week
- Shared Middle-Layer for Triton Compilation☆318Updated 2 weeks ago
- Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure☆953Updated last week
- Assembler for NVIDIA Volta and Turing GPUs☆234Updated 3 years ago
- A home for the final text of all TVM RFCs.☆108Updated last year
- Stores documents and resources used by the OpenXLA developer community☆131Updated last year
- A model compilation solution for various hardware☆457Updated 4 months ago
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆138Updated 2 years ago
- ☆248Updated 4 months ago
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆365Updated last week
- MLIR-based partitioning system☆151Updated this week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆145Updated last week
- Play with MLIR right in your browser☆138Updated 2 years ago
- The Tensor Algebra SuperOptimizer for Deep Learning☆731Updated 2 years ago
- OpenAI Triton backend for Intel® GPUs☆222Updated this week
- heterogeneity-aware-lowering-and-optimization☆257Updated last year
- Development repository for the Triton-Linalg conversion☆206Updated 10 months ago
- TPP experimentation on MLIR for linear algebra☆140Updated last week
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆560Updated 2 years ago
- ☆162Updated this week
- Experimental projects related to TensorRT☆117Updated this week
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆499Updated this week
- A sandbox for quick iteration and experimentation on projects related to IREE, MLIR, and LLVM☆61Updated 8 months ago
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆1,001Updated last year
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆256Updated this week
- IREE's PyTorch Frontend, based on Torch Dynamo.☆102Updated this week
- PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections☆122Updated 3 years ago
- ☆145Updated 10 months ago
- MLIR Sample dialect☆132Updated 10 months ago