tsung-wei-huang / repo759
About repo with information useful for the Fall 2024 offering of ECE 759 - High Performance Computing for Applications in Engineering
☆25Updated 4 months ago
Alternatives and similar repositories for repo759:
Users that are interested in repo759 are comparing it to the libraries listed below
- A repository where GPU applications are aggregated using a common build flow that supports multiple CUDA versions.☆59Updated this week
- TileFlow is a performance analysis tool based on Timeloop for fusion dataflows☆58Updated 10 months ago
- Bridging polyhedral analysis tools to the MLIR framework☆108Updated last year
- A portable implementation of SZ lossy compression for AMD GPUs and Hygon DCUs.☆8Updated last week
- ☆90Updated this week
- A heterogeneous architecture timing model simulator.☆147Updated 2 months ago
- GPGPU-Sim provides a detailed simulation model of a contemporary GPU running CUDA and/or OpenCL workloads and now includes an integrated…☆51Updated last week
- Heterogeneous Programming☆17Updated last year
- Triton to TVM transpiler.☆18Updated 4 months ago
- A highly-flexible GPU simulator for AMD GPUs.☆123Updated this week
- An MLIR-based toy DL compiler for TVM Relay.☆56Updated 2 years ago
- A novel spatial accelerator for horizontal diffusion weather stencil computation, as described in ICS 2023 paper by Singh et al. (https:/…☆18Updated last year
- ☆13Updated 3 years ago
- ☆30Updated 2 years ago
- An open-source parameterizable NPU generator with full-stack multi-target compilation stack for intelligent workloads.☆46Updated last week
- Source code of the simulator used in the Mosaic paper from MICRO 2017: "Mosaic: A GPU Memory Manager with Application-Transparent Support…☆43Updated 6 years ago
- ☆47Updated 5 years ago
- Alveo Collective Communication Library: MPI-like communication operations for Xilinx Alveo accelerators☆86Updated 4 months ago
- ☆68Updated 4 years ago
- Hands-on experience programming AI Engines using Vitis Unified Software Platform☆39Updated 7 months ago
- Artifact evaluation of PLDI'24 paper "Allo: A Programming Model for Composable Accelerator Design"☆20Updated 10 months ago
- OriGen: Enhancing RTL Code Generation with Code-to-Code Augmentation and Self-Reflection(ICCAD 2024)☆15Updated 4 months ago
- Data-Centric MLIR dialect☆40Updated last year
- Polyhedral High-Level Synthesis in MLIR☆30Updated last year
- A PIM instrumentation, compilation, execution, simulation, and evaluation repository for BLIMP-style architectures.☆18Updated 2 years ago
- ☆33Updated last week
- LLVM OpenCL C compiler suite for ventus GPGPU☆41Updated 3 weeks ago
- [ASPLOS 2024] CIM-MLC: A Multi-level Compilation Stack for Computing-In-Memory Accelerators☆28Updated 9 months ago