TUE-EE-ES / HalideAutoGPULinks
☆11Updated 4 years ago
Alternatives and similar repositories for HalideAutoGPU
Users that are interested in HalideAutoGPU are comparing it to the libraries listed below
Sorting:
- a Halide language To MLIR compiler.☆26Updated 3 years ago
- CNNs in Halide☆23Updated 9 years ago
- Evaluating different memory managers for dynamic GPU memory☆25Updated 4 years ago
- HeteroCL-MLIR dialect for accelerator design☆41Updated 8 months ago
- Polyhedral High-Level Synthesis in MLIR☆31Updated 2 years ago
- Polyhedral Compilation tool for High Level Synthesis.☆10Updated 11 years ago
- ☆29Updated 2 years ago
- A domain-specific language and compiler for image processing☆76Updated 4 years ago
- HeteroHalide: From Image Processing DSL to Efficient FPGA Acceleration☆15Updated 4 years ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆52Updated 2 months ago
- A translator from c to MLIR☆28Updated 3 years ago
- ☆35Updated 3 years ago
- Public Release of Stream-Dataflow☆14Updated 6 years ago
- Code released to accompany the ISCA paper: "T4: Compiling Sequential Code for Effective Speculative Parallelization in Hardware"☆28Updated 3 years ago
- TAPA is a dataflow HLS framework that features fast compilation, expressive programming model and generates high-frequency FPGA accelerat…☆19Updated 9 months ago
- HLS branch of Halide☆76Updated 6 years ago
- TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.☆19Updated last year
- compiling DSLs to high-level hardware instructions☆22Updated 2 years ago
- ☆21Updated 3 years ago
- ☆17Updated 3 years ago
- MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.com☆38Updated last year
- Sparse-dense matrix-matrix multiplication on GPUs☆14Updated 6 years ago
- ☆14Updated 5 years ago
- GARDENIA: Graph Analytics Repository for Designing Efficient Next-generation Accelerators☆31Updated 3 years ago
- Benchmark PyTorch Custom Operators☆14Updated last year
- An MLIR-based toy DL compiler for TVM Relay.☆58Updated 2 years ago
- CUDAAdvisor: a GPU profiling tool☆49Updated 6 years ago
- Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"☆28Updated 4 years ago
- ICML2017 MEC: Memory-efficient Convolution for Deep Neural Network C++实现(非官方)☆17Updated 6 years ago
- A lightweight, Pythonic, frontend for MLIR☆81Updated last year