uwplse / dexter
a compiler for re-writing image processing functions in C++ to Halide
☆23Updated 2 years ago
Alternatives and similar repositories for dexter:
Users that are interested in dexter are comparing it to the libraries listed below
- A Halide journey taken for pleasure, this repo will hopefully serve a collection of Halide imaging functions that are useful to the commu…☆15Updated 9 years ago
- A domain-specific language and compiler for image processing☆76Updated 4 years ago
- Program Generator for Small-Scale Linear Algebra Applications☆29Updated 6 years ago
- GPUVerify: a Verifier for GPU Kernels☆60Updated 2 years ago
- a Halide language To MLIR compiler.☆26Updated 3 years ago
- Data Dependence Analyzer in the Polyhedral Model☆20Updated last year
- A framework that helps implementing swizzle GPU kernels☆41Updated 5 years ago
- SYCL-ML is a C++ library, implementing classical machine learning algorithms using SYCL.☆66Updated 5 years ago
- npcomp - An aspirational MLIR based numpy compiler☆51Updated 4 years ago
- CNNs in Halide☆23Updated 9 years ago
- Skeletonide is a parallel implementation of Zhang-Suen morphological thinning algorithm written in Halide-lang. Use it for fast skeletoni…☆13Updated 4 years ago
- Cooperative Primitives for CUDA C++ Kernel Authors. This repository contains CUB PRs from Q4 2019 until Q4 2020.☆22Updated 4 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- c++ posit implementation☆44Updated last year
- Fast integer division with divisor not known at compile time. To be used primarily in CUDA kernels.☆70Updated 9 years ago
- ☆102Updated 5 years ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆50Updated 2 weeks ago
- OpenCL/SPIR-V implementation of HIP☆104Updated 2 years ago
- ☆11Updated 4 years ago
- tokenizer and parser for circle projects☆11Updated 5 years ago
- RV: A Unified Region Vectorizer for LLVM☆107Updated 2 months ago
- Case Studies for Halide performance against C++ and OpenCL☆37Updated 11 years ago
- A system for programming formally-verified loop transformations.☆16Updated 6 years ago
- ☆56Updated 2 weeks ago
- Reference implementation of the draft C++ GraphBLAS specification.☆30Updated last month
- Tensor Tiling Library☆34Updated last month
- Experimental ranges for CUDA☆24Updated 6 years ago
- ☆58Updated this week
- Automatic Differentiation for high-performance stencil loops☆12Updated 4 years ago
- ☆27Updated 6 years ago