zohourih / Diffusion_FPGALinks
Highly-optimized spatially and temporally-blocked implementation of Diffusion 2D and 3D stencils for Intel FPGAs using OpenCL
☆13Updated last year
Alternatives and similar repositories for Diffusion_FPGA
Users that are interested in Diffusion_FPGA are comparing it to the libraries listed below
Sorting:
- Fork of upstream onnxruntime focused on supporting risc-v accelerators☆87Updated 2 years ago
- Alveo Collective Communication Library: MPI-like communication operations for Xilinx Alveo accelerators☆97Updated 2 months ago
- SMASH is a hardware-software cooperative mechanism that enables highly-efficient indexing and storage of sparse matrices. The key idea of…☆16Updated 5 years ago
- Example for running IREE in a bare-metal Arm environment.☆40Updated last month
- Heterogeneous Research Platform (HERO) for exploration of heterogeneous computers consisting of programmable many-core accelerators and a…☆111Updated 2 years ago
- FPGA version of Rodinia in HLS C/C++☆40Updated 4 years ago
- Custom BLAS and LAPACK Cross-Compilation Framework for RISC-V☆19Updated 5 years ago
- Virtualized Accelerator Orchestration for Multi-Tenant Workloads☆18Updated 10 months ago
- [FCCM 2023] PASTA: Programming and Automation Support for Scalable Task-Parallel HLS Programs on Modern Multi-Die FPGAs☆12Updated 2 months ago
- TensorCore Vector Processor for Deep Learning - Google Summer of Code Project☆22Updated 4 years ago
- HW/SW co-design of sentence-level energy optimizations for latency-aware multi-task NLP inference☆52Updated last year
- ☆72Updated 11 months ago
- ☆102Updated last year
- The Riallto Open Source Project from AMD☆83Updated 5 months ago
- Heterogeneous Accelerated Computed Cluster (HACC) Resources Page☆22Updated this week
- ☆76Updated this week
- FlexGripPlus: an open-source GPU model for reliability evaluation and micro architectural simulation☆108Updated 2 years ago
- A novel spatial accelerator for horizontal diffusion weather stencil computation, as described in ICS 2023 paper by Singh et al. (https:/…☆22Updated 2 years ago
- FPGA-based hardware acceleration for dropout-based Bayesian Neural Networks.☆26Updated 2 years ago
- ☆33Updated 2 years ago
- SST Macro Element Library☆37Updated 2 months ago
- ☆37Updated last year
- ETHZ Heterogeneous Accelerated Compute Cluster.☆37Updated 5 months ago
- ☆26Updated 4 years ago
- For CPU experiment☆12Updated 4 years ago
- Converting a deep neural network to integer-only inference in native C via uniform quantization and the fixed-point representation.☆25Updated 3 years ago
- FleetRec: Large-Scale Recommendation Inference on Hybrid GPU-FPGA Clusters☆17Updated 4 years ago
- ☆181Updated last month
- Hands-on experience using the Vitis unified software platform with Xilinx FPGA hardware☆48Updated last year
- XRM (Xilinx FPGA Resource Manager) Document:☆25Updated last year