ChrisCummins / cldrive
πββοΈ Run arbitrary OpenCL kernels
β9Updated last year
Alternatives and similar repositories for cldrive:
Users that are interested in cldrive are comparing it to the libraries listed below
- Deep learning program generatorβ104Updated last year
- Kernel Tuning Toolkitβ55Updated 2 months ago
- DSL for stencils and image processingβ14Updated 8 years ago
- π "Synthesizing Benchmarks for Predictive Modeling" (π₯ CGO'17 Best Paper)β22Updated last year
- This repository contains my experiments with compression-related algorithmsβ35Updated 8 years ago
- MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.comβ38Updated last year
- maxas Scott Grey's maxas assembler sgemm explaining the (for me) missing parts https://github.com/NervanaSystems/maxasβ13Updated 6 years ago
- A framework that helps implementing swizzle GPU kernelsβ41Updated 4 years ago
- Chunky Loop Analyzer: A Polyhedral Representation Extraction Tool for High Level Programsβ23Updated 2 years ago
- A Sound and Complete Verification Tool for Warp-Specialized GPU Kernelsβ18Updated 9 years ago
- XLA integration of Open Neural Network Exchange (ONNX)β19Updated 6 years ago
- a Halide language To MLIR compiler.β26Updated 3 years ago
- Chunky Loop Interactionβ23Updated 5 years ago
- β11Updated 3 years ago
- Experiments and prototypes associated with IREE or MLIRβ51Updated 5 months ago
- A model checker based on SAT solving and inductionβ13Updated 9 years ago
- A GPU cache model for research purposesβ26Updated 11 years ago
- A tracing JIT compiler for PyTorchβ12Updated 3 years ago
- NeuroVectorizer is a framework that uses deep reinforcement learning (RL) to predict optimal vectorization compiler pragmas for for loopsβ¦β91Updated 2 years ago
- Tensor Compute Primitives: Mid-level Intermediate Representation for Machine Learning Programsβ36Updated last month
- a tester for BLAS libraries including OpenBLAS and Intel MKL. This project is based on ATLAS BLAS Testerβ34Updated last year
- Enabling on-the-fly manipulations with LLVM IR code of CUDA sourcesβ108Updated last year
- The Insieme Compiler and Runtime Infrastructureβ33Updated 5 years ago
- A repository to test dialects defined dynamically.β12Updated last year
- GPUVerify: a Verifier for GPU Kernelsβ59Updated 2 years ago
- Polyhedral Parallel Code Generation (source repository: http://repo.or.cz/ppcg.git)β120Updated 2 years ago
- Data Dependence Analyzer in the Polyhedral Modelβ19Updated last year
- Asynchronous Task and Memory Interface, or ATMI, is a runtime framework and programming model for heterogeneous CPU-GPU systems. It proviβ¦β66Updated 11 months ago
- An MLIR frontend for tensor expressionsβ24Updated 4 years ago
- A fast and highly scalable GPU dynamic memory allocatorβ103Updated 9 years ago