argonne-lcf / AIaccelerators-SC23-tutorialLinks
AI Accelerators-SC23-tutorial Repository
☆11Updated 2 years ago
Alternatives and similar repositories for AIaccelerators-SC23-tutorial
Users that are interested in AIaccelerators-SC23-tutorial are comparing it to the libraries listed below
Sorting:
- Streaming Message Interface: High-Performance Distributed Memory Programming on Reconfigurable Hardware☆15Updated 3 years ago
- Benchmark for measuring the performance of sparse and irregular memory access.☆82Updated 4 months ago
- A novel spatial accelerator for horizontal diffusion weather stencil computation, as described in ICS 2023 paper by Singh et al. (https:/…☆22Updated 2 years ago
- ☆17Updated 4 years ago
- Heterogeneous Accelerated Computed Cluster (HACC) Resources Page☆22Updated 3 months ago
- TAPA is a dataflow HLS framework that features fast compilation, expressive programming model and generates high-frequency FPGA accelerat…☆19Updated last year
- SparseP is the first open-source Sparse Matrix Vector Multiplication (SpMV) software package for real-world Processing-In-Memory (PIM) ar…☆77Updated 3 years ago
- BLAS implementation for Intel FPGA☆78Updated 5 years ago
- ☆41Updated 3 months ago
- Tutorial Material from the SST Team☆25Updated 5 months ago
- COCCL: Compression and precision co-aware collective communication library☆29Updated 9 months ago
- SST Macro Element Library☆36Updated 2 months ago
- Alveo Collective Communication Library: MPI-like communication operations for Xilinx Alveo accelerators☆98Updated 6 months ago
- SST Architectural Simulation Components and Libraries☆111Updated 2 weeks ago
- ☆48Updated 5 years ago
- ☆65Updated last year
- SST Structural Simulation Toolkit Parallel Discrete Event Core and Services☆187Updated this week
- Artifact for "DX100: A Programmable Data Access Accelerator for Indirection (ISCA 2025)" paper☆16Updated 2 months ago
- Python Cache Hierarchy Simulator☆101Updated 5 months ago
- Chai☆47Updated last month
- Multiple 1-stencil implementations using nvidia cuda.☆13Updated 8 years ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Updated 9 months ago
- ☆35Updated this week
- ☆18Updated last year
- The University of Bristol HPC Simulation Engine☆104Updated 4 months ago
- ☆20Updated 6 years ago
- UniSparse: An Intermediate Language for General Sparse Format Customization (OOPSLA'24)☆33Updated last year
- Multi-target compiler for Sum-Product Networks, based on MLIR and LLVM.☆25Updated last year
- Hands-on experience programming AI Engines using Vitis Unified Software Platform☆40Updated last year
- ☆31Updated 3 years ago