argonne-lcf / AIaccelerators-SC23-tutorial
AI Accelerators-SC23-tutorial Repository
☆11Updated last year
Related projects ⓘ
Alternatives and complementary repositories for AIaccelerators-SC23-tutorial
- A novel spatial accelerator for horizontal diffusion weather stencil computation, as described in ICS 2023 paper by Singh et al. (https:/…☆19Updated last year
- Streaming Message Interface: High-Performance Distributed Memory Programming on Reconfigurable Hardware☆16Updated 2 years ago
- Heterogeneous Accelerated Computed Cluster (HACC) Resources Page☆19Updated this week
- ☆25Updated last month
- SparseP is the first open-source Sparse Matrix Vector Multiplication (SpMV) software package for real-world Processing-In-Memory (PIM) ar…☆70Updated 2 years ago
- ☆15Updated 3 years ago
- Multi-target compiler for Sum-Product Networks, based on MLIR and LLVM.☆23Updated this week
- TAPA is a dataflow HLS framework that features fast compilation, expressive programming model and generates high-frequency FPGA accelerat…☆19Updated 2 months ago
- Alveo Collective Communication Library: MPI-like communication operations for Xilinx Alveo accelerators☆81Updated last month
- ☆37Updated this week
- Code base for OOPSLA'24 paper: UniSparse: An Intermediate Language for General Sparse Format Customization☆28Updated last week
- A repository where GPU applications are aggregated using a common build flow that supports multiple CUDA versions.☆45Updated last month
- Multiple 1-stencil implementations using nvidia cuda.☆13Updated 6 years ago
- GPGPU-Sim provides a detailed simulation model of a contemporary GPU running CUDA and/or OpenCL workloads and now includes an integrated…☆37Updated 3 weeks ago
- Heterogeneous simulator for DECADES Project☆29Updated 5 months ago
- ☆41Updated 4 years ago
- HeteroCL-MLIR dialect for accelerator design☆40Updated 2 months ago
- ETHZ Heterogeneous Accelerated Compute Cluster.☆29Updated last month
- ☆47Updated 5 years ago
- A PIM instrumentation, compilation, execution, simulation, and evaluation repository for BLIMP-style architectures.☆16Updated 2 years ago
- ☆17Updated 2 years ago
- C++/MPI proxies for distributed training of deep neural networks.☆12Updated 2 years ago
- Benchmark for measuring the performance of sparse and irregular memory access.☆75Updated this week
- Hands-on experience programming AI Engines using Vitis Unified Software Platform☆37Updated 3 months ago
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆31Updated 3 years ago
- The Splash-3 benchmark suite☆42Updated last year
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆27Updated 2 months ago
- Data-Centric MLIR dialect☆38Updated last year
- GPTPU for SC 2021☆48Updated last year
- EQueue Dialect☆39Updated 2 years ago