☆17Sep 15, 2021Updated 4 years ago
Alternatives and similar repositories for stencilflow
Users that are interested in stencilflow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Streaming Message Interface: High-Performance Distributed Memory Programming on Reconfigurable Hardware☆15Mar 1, 2022Updated 4 years ago
- development repository for the open earth compiler☆82Feb 19, 2021Updated 5 years ago
- A Data-Centric Compiler for Machine Learning☆85Dec 14, 2025Updated 6 months ago
- SQL Optimizations using MLIR☆12Apr 5, 2020Updated 6 years ago
- Compiler toolchain to enable generation of high-level DSLs for geophysical fluid dynamics models☆29Mar 22, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Tutorial Material from the SST Team☆27Aug 5, 2025Updated 10 months ago
- Data-Centric MLIR dialect☆47Oct 16, 2023Updated 2 years ago
- [ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yonga…☆16Jan 3, 2022Updated 4 years ago
- Spiking Neural Network Accelerator☆15May 18, 2022Updated 4 years ago
- RDMA-enabled Apache Kafka☆21Jun 13, 2022Updated 4 years ago
- GPU Code optimizer for stencil computations. Refer to our IPDPS'19 paper for more details☆25Sep 27, 2019Updated 6 years ago
- StarPU Runtime system☆16Sep 22, 2010Updated 15 years ago
- MLIR tools and dialect for GraphBLAS☆18Mar 30, 2022Updated 4 years ago
- DaCe - Data Centric Parallel Programming☆590Updated this week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Distributed-memory, arbitrary-precision, dense and sparse-direct linear algebra, conic optimization, and lattice reduction☆72Mar 17, 2025Updated last year
- Data structures for graph neural network☆19May 25, 2024Updated 2 years ago
- Next generation CGRA generator☆119May 26, 2026Updated last month
- This repo contains the source code of the project "FPGA implementation of BCIs using QCNNs" submitted to the Xilinx Open Hardware Design …☆18Dec 14, 2021Updated 4 years ago
- Rodinia Benchmark Suite for OpenCL-based FPGAs☆31Apr 11, 2023Updated 3 years ago
- Rich editor for SDFGs with included profiling and debugging, static analysis, and interactive optimization.☆22Dec 9, 2025Updated 6 months ago
- ☆13Jul 9, 2021Updated 4 years ago
- BLAS implementation for Intel FPGA☆78Nov 18, 2020Updated 5 years ago
- RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads☆48Apr 7, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Slides and material for Xilinx bootcamp☆22Aug 6, 2021Updated 4 years ago
- Prototype of OpenSHMEM for NVIDIA GPUs, developed as part of DoE Design Forward☆24Apr 26, 2018Updated 8 years ago
- Meta-Repository for Bespoke Silicon Group's Manycore Architecture (A.K.A HammerBlade)☆45Jun 16, 2025Updated last year
- AI-ML-NLP Task Group☆13Aug 10, 2023Updated 2 years ago
- Simplified Interface to Complex Memory☆29Aug 31, 2023Updated 2 years ago
- MoSAIC: Modular system for Acceleration Integration MoSAIC☆10Jun 17, 2026Updated last week
- mantle library☆44Dec 20, 2022Updated 3 years ago
- Example code for an MMIO plugin for Spike, the RISC-V ISA simulator.☆12Aug 29, 2019Updated 6 years ago
- ☆27Mar 14, 2019Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Intermediate MPI lesson☆28Apr 29, 2023Updated 3 years ago
- Loam system models☆16Dec 30, 2019Updated 6 years ago
- LUMI software stack: LMOD-based module setup and EasyBuild setup.☆14Jun 17, 2026Updated last week
- Implementations of Buffets, which are efficient, composable idioms for implementing Explicit Decoupled Data Orchestration.☆82Apr 30, 2019Updated 7 years ago
- Pytorch process group third-party plugin for UCC☆22Apr 15, 2024Updated 2 years ago
- ☆18Mar 26, 2022Updated 4 years ago
- Nanos++ is a runtime designed to serve as runtime support in parallel environments. It is mainly used to support OmpSs, a extension to O…☆38Aug 18, 2021Updated 4 years ago