keryell / muSYCL
muSYCL, the SYCL musical!
☆12Updated 4 months ago
Alternatives and similar repositories for muSYCL:
Users that are interested in muSYCL are comparing it to the libraries listed below
- Generate versal system design from ONNX model. AI engine kernels. Sub-microsecond speeds for autoencoders.☆9Updated 3 weeks ago
- SYCL for Vitis: Experimental fusion of triSYCL with Intel SYCL oneAPI DPC++ up-streaming effort into Clang/LLVM☆115Updated 2 months ago
- A polyhedral compiler for hardware accelerators☆55Updated 5 months ago
- Streaming Message Interface: High-Performance Distributed Memory Programming on Reconfigurable Hardware☆16Updated 2 years ago
- Benchmark for measuring the performance of sparse and irregular memory access.☆76Updated this week
- High-Performance Reproducible BLAS using posit arithmetic☆12Updated 2 years ago
- BLAS implementation for Intel FPGA☆76Updated 4 years ago
- ☆25Updated 2 years ago
- IREE plugin repository for the AMD AIE accelerator☆72Updated this week
- Accelerator simulation framework using nn_dataflow traces and energy, etc. post-processing☆7Updated 5 years ago
- Library to plot integer sets and maps☆48Updated 8 years ago
- ☆27Updated 5 years ago
- Fast matrix multiplication for few-bit integer matrices on CPUs.☆27Updated 5 years ago
- A SYCL-specific LLVM-to-MLIR converter☆1Updated last year
- Heterogeneous Accelerated Computed Cluster (HACC) Resources Page☆20Updated last month
- A Language for Closed-form High-level ARchitecture Modeling☆19Updated 4 years ago
- Open Source Compiler Framework using ONNX as Frontend and IR☆29Updated 2 years ago
- Xilinx Modifications to Halide☆12Updated 3 years ago
- SAMO: Streaming Architecture Mapping Optimisation☆32Updated last year
- SYCL Benchmark Suite☆60Updated 4 months ago
- Implementation of the SYCL specification.☆67Updated 7 months ago
- Chai☆42Updated last year
- ColTraIn HBFP Training Emulator☆16Updated last year
- ☆20Updated 3 years ago
- Statistics on GPUs☆29Updated 4 months ago
- Header-only library of GPU-accelerated, concurrent data structures.☆10Updated last month
- Multi-target compiler for Sum-Product Networks, based on MLIR and LLVM.☆23Updated last month
- Chisel Project for Integrating RTL code into SDAccel☆17Updated 7 years ago
- HLS branch of Halide☆77Updated 6 years ago
- A novel spatial accelerator for horizontal diffusion weather stencil computation, as described in ICS 2023 paper by Singh et al. (https:/…☆18Updated last year