☆17Sep 15, 2021Updated 4 years ago
Alternatives and similar repositories for stencilflow
Users that are interested in stencilflow are comparing it to the libraries listed below
Sorting:
- Streaming Message Interface: High-Performance Distributed Memory Programming on Reconfigurable Hardware☆15Mar 1, 2022Updated 4 years ago
- development repository for the open earth compiler☆82Feb 19, 2021Updated 5 years ago
- A Data-Centric Compiler for Machine Learning☆85Dec 14, 2025Updated 3 months ago
- SQL Optimizations using MLIR☆12Apr 5, 2020Updated 5 years ago
- Tutorial Material from the SST Team☆25Aug 5, 2025Updated 7 months ago
- Data-Centric MLIR dialect☆46Oct 16, 2023Updated 2 years ago
- [ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yonga…☆16Jan 3, 2022Updated 4 years ago
- RDMA-enabled Apache Kafka☆21Jun 13, 2022Updated 3 years ago
- GPU Code optimizer for stencil computations. Refer to our IPDPS'19 paper for more details☆25Sep 27, 2019Updated 6 years ago
- StarPU Runtime system☆16Sep 22, 2010Updated 15 years ago
- MLIR tools and dialect for GraphBLAS☆18Mar 30, 2022Updated 3 years ago
- Algebraic multigrid benchmark☆34Jul 9, 2024Updated last year
- High-level framework for stencil computations☆40Apr 21, 2015Updated 10 years ago
- DaCe - Data Centric Parallel Programming☆580Updated this week
- Distributed-memory, arbitrary-precision, dense and sparse-direct linear algebra, conic optimization, and lattice reduction☆71Mar 17, 2025Updated last year
- Next generation CGRA generator☆119Mar 13, 2026Updated last week
- This repo contains the source code of the project "FPGA implementation of BCIs using QCNNs" submitted to the Xilinx Open Hardware Design …☆16Dec 14, 2021Updated 4 years ago
- A backend-dispatchable version of NumPy.☆19Feb 27, 2021Updated 5 years ago
- ☆12Jul 9, 2021Updated 4 years ago
- BLAS implementation for Intel FPGA☆78Nov 18, 2020Updated 5 years ago
- RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads☆47Apr 7, 2021Updated 4 years ago
- Slides and material for Xilinx bootcamp☆22Aug 6, 2021Updated 4 years ago
- Meta-Repository for Bespoke Silicon Group's Manycore Architecture (A.K.A HammerBlade)☆44Jun 16, 2025Updated 9 months ago
- Distributed k-nearest Neighbors using Locality Sensitive Hashing and SYCL☆10Jun 7, 2021Updated 4 years ago
- AI-ML-NLP Task Group☆13Aug 10, 2023Updated 2 years ago
- A free frontend for the conformal bootstrap☆22Jul 9, 2024Updated last year
- Simplified Interface to Complex Memory☆28Aug 31, 2023Updated 2 years ago
- MoSAIC: Modular system for Acceleration Integration MoSAIC☆10Aug 22, 2025Updated 6 months ago
- mantle library☆44Dec 20, 2022Updated 3 years ago
- Example code for an MMIO plugin for Spike, the RISC-V ISA simulator.☆12Aug 29, 2019Updated 6 years ago
- Interactive Pinout Generator☆13Dec 28, 2025Updated 2 months ago
- Intermediate MPI lesson☆27Apr 29, 2023Updated 2 years ago
- Loam system models☆16Dec 30, 2019Updated 6 years ago
- Implementations of Buffets, which are efficient, composable idioms for implementing Explicit Decoupled Data Orchestration.☆82Apr 30, 2019Updated 6 years ago
- Pytorch process group third-party plugin for UCC☆21Apr 15, 2024Updated last year
- ☆18Mar 26, 2022Updated 3 years ago
- Nanos++ is a runtime designed to serve as runtime support in parallel environments. It is mainly used to support OmpSs, a extension to O…☆38Aug 18, 2021Updated 4 years ago
- The Synapse Neuron Wallet - Binary Releases☆15Dec 17, 2018Updated 7 years ago
- Learning Pytorch☆13Jun 12, 2018Updated 7 years ago