uwplse / dexter
a compiler for re-writing image processing functions in C++ to Halide
☆23Updated 2 years ago
Alternatives and similar repositories for dexter:
Users that are interested in dexter are comparing it to the libraries listed below
- A Halide journey taken for pleasure, this repo will hopefully serve a collection of Halide imaging functions that are useful to the commu…☆15Updated 9 years ago
- GPUVerify: a Verifier for GPU Kernels☆59Updated 2 years ago
- IMPORTANT NOTICE: This implementation is long outdated. The new libwfv will be released soon. Whole-Function Vectorization is an algorith…☆22Updated 12 years ago
- A domain-specific language and compiler for image processing☆76Updated 3 years ago
- a Halide language To MLIR compiler.☆26Updated 3 years ago
- A framework that helps implementing swizzle GPU kernels☆42Updated 5 years ago
- AnyDSL traversal code☆15Updated 6 years ago
- Data Dependence Analyzer in the Polyhedral Model☆19Updated last year
- RV: A Unified Region Vectorizer for LLVM☆107Updated last month
- tokenizer and parser for circle projects☆11Updated 5 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- Case Studies for Halide performance against C++ and OpenCL☆37Updated 11 years ago
- Program Generator for Small-Scale Linear Algebra Applications☆29Updated 6 years ago
- CNNs in Halide☆23Updated 9 years ago
- Skeletonide is a parallel implementation of Zhang-Suen morphological thinning algorithm written in Halide-lang. Use it for fast skeletoni…☆13Updated 4 years ago
- ☆102Updated 5 years ago
- ☆26Updated 6 years ago
- A C-family AST implementation designed to be an IR for DSL compilers.☆16Updated 7 years ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆50Updated last year
- portDNN is a library implementing neural network algorithms written using SYCL☆111Updated 9 months ago
- A thin, highly portable toolkit for efficiently compiling dense loop-based computation.☆148Updated 2 years ago
- c++ posit implementation☆44Updated last year
- Library to plot integer sets and maps☆49Updated 8 years ago
- A fast and highly scalable GPU dynamic memory allocator☆104Updated 9 years ago
- OpenCL/SPIR-V implementation of HIP☆104Updated 2 years ago
- ☆11Updated 4 years ago
- Enabling on-the-fly manipulations with LLVM IR code of CUDA sources☆110Updated 2 years ago
- Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal - all it takes to sum a lot of numbers fast!☆88Updated last week
- CUDAAdvisor: a GPU profiling tool☆48Updated 6 years ago
- NeuroVectorizer is a framework that uses deep reinforcement learning (RL) to predict optimal vectorization compiler pragmas for for loops…☆92Updated 2 years ago