bcarlet / ptx-mathLinks
☆15Updated 2 years ago
Alternatives and similar repositories for ptx-math
Users that are interested in ptx-math are comparing it to the libraries listed below
Sorting:
- Embedded Universal DSL: a good DSL for us, by us☆37Updated this week
- ☆29Updated 2 years ago
- Bridging polyhedral analysis tools to the MLIR framework☆111Updated last year
- Conversions to MLIR EmitC☆128Updated 5 months ago
- An MLIR frontend for tensor expressions☆25Updated 4 years ago
- ☆35Updated 3 years ago
- ☆57Updated this week
- A framework that helps implementing swizzle GPU kernels☆41Updated 5 years ago
- CERE: Codelet Extractor and REplayer☆40Updated last year
- Tutorial for LLVM Dev Conference 2019.☆15Updated 5 years ago
- ☆52Updated 5 years ago
- TPP experimentation on MLIR for linear algebra☆131Updated this week
- ☆55Updated 6 years ago
- Data-Centric MLIR dialect☆42Updated last year
- LLVM-Canon aims to transform LLVM modules into a canonical form by reordering and renaming instructions while preserving the same semanti…☆15Updated last year
- IREE C++ Template☆17Updated 10 months ago
- development repository for the open earth compiler☆80Updated 4 years ago
- MLIR-based toolkit targeting intel heterogeneous hardware☆44Updated 3 months ago
- Fork of LLVM for demonstrating optimization pass development☆30Updated 2 years ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆52Updated 2 months ago
- Library to interface Compilers and ML models for ML-Enabled Compiler Optimizations☆18Updated 2 months ago
- MLIR metal dialect☆27Updated 8 months ago
- Unified compiler/runtime for interfacing with PyTorch Dynamo.☆100Updated 2 weeks ago
- CUDAAdvisor: a GPU profiling tool☆49Updated 6 years ago
- A lightweight, Pythonic, frontend for MLIR☆81Updated last year
- Benchmarks for auto-vectorization and revectorization, including both hand-vectorized and scalar code☆28Updated 6 years ago
- Declarative MLIR compilers in Python!☆35Updated 4 years ago
- CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.☆119Updated 2 years ago
- Enabling on-the-fly manipulations with LLVM IR code of CUDA sources☆111Updated last month
- Retargetable ML compilers for the twenty-first century!☆13Updated last month