PAA-NCIC / PE
performance engineering
☆27Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for PE
- ☆82Updated this week
- Performance Prediction Toolkit for GPUs☆31Updated 2 years ago
- ☆30Updated 4 months ago
- ☆32Updated last year
- ☆24Updated 7 months ago
- A highly-flexible GPU simulator for AMD GPUs.☆93Updated this week
- Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators☆103Updated 2 years ago
- MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN (ASPLOS'24)☆43Updated 5 months ago
- ☆25Updated 4 years ago
- REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…☆85Updated last year
- ☆41Updated 6 months ago
- Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.☆81Updated last year
- ☆23Updated 4 years ago
- Source code of the PPoPP '22 paper: "TileSpGEMM: A Tiled Algorithm for Parallel Sparse General Matrix-Matrix Multiplication on GPUs" by Y…☆38Updated 5 months ago
- ☆84Updated 4 months ago
- ☆9Updated 2 years ago
- ☆24Updated 5 months ago
- ☆73Updated last year
- ☆56Updated 2 years ago
- ☆44Updated 5 years ago
- Benchmark Framework for Buddy Projects☆46Updated 3 weeks ago
- OSDI 2023 Welder, deeplearning compiler☆16Updated 11 months ago
- ☆23Updated 2 years ago
- Artifact for PPoPP22 QGTC: Accelerating Quantized GNN via GPU Tensor Core.☆27Updated 2 years ago
- Compiler for Dynamic Neural Networks☆43Updated last year
- A New Format for SIMD-accelerated SpMV☆19Updated 2 years ago
- SC'22 Artifacts Evaluation☆9Updated 2 years ago
- ucas hpc course code☆13Updated last year
- A repository where GPU applications are aggregated using a common build flow that supports multiple CUDA versions.☆45Updated last month
- An Optimizing Compiler for Recommendation Model Inference☆22Updated 9 months ago