PAA-NCIC / PE
performance engineering
☆27Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for PE
- ☆82Updated 5 months ago
- Performance Prediction Toolkit for GPUs☆31Updated 2 years ago
- ☆11Updated 2 years ago
- ☆23Updated 4 months ago
- A Row Decomposition-based Approach for Sparse Matrix Multiplication on GPUs☆11Updated 11 months ago
- Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.☆81Updated last year
- Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators☆102Updated 2 years ago
- ☆81Updated 4 months ago
- Source code of the PPoPP '22 paper: "TileSpGEMM: A Tiled Algorithm for Parallel Sparse General Matrix-Matrix Multiplication on GPUs" by Y…☆38Updated 5 months ago
- ☆25Updated 4 years ago
- ☆9Updated 2 years ago
- Dissecting NVIDIA GPU Architecture☆82Updated 2 years ago
- A highly-flexible GPU simulator for AMD GPUs.☆92Updated 2 weeks ago
- Personal Notes for Learning HPC & Parallel Computation [Active Adding New Content]☆59Updated 2 years ago
- LLM Inference analyzer for different hardware platforms☆41Updated last week
- ☆32Updated last year
- MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN (ASPLOS'24)☆43Updated 5 months ago
- ☆16Updated 6 months ago
- Spack package repository maintained by Student Cluster Competition Team @ Sun Yat-sen University.☆14Updated 4 months ago
- ☆27Updated 3 months ago
- Solution of Programming Massively Parallel Processors☆31Updated 9 months ago
- ☆24Updated 7 months ago
- ☆10Updated 9 months ago
- Artifact for PPoPP22 QGTC: Accelerating Quantized GNN via GPU Tensor Core.☆27Updated 2 years ago
- ☆24Updated 6 months ago
- ☆44Updated 5 years ago
- A New Format for SIMD-accelerated SpMV☆19Updated 2 years ago
- Curated collection of papers in machine learning systems☆156Updated last month
- High performance Transformer implementation in C++.☆78Updated last month
- Source code of the IPDPS '21 paper: "TileSpMV: A Tiled Algorithm for Sparse Matrix-Vector Multiplication on GPUs" by Yuyao Niu, Zhengyang…☆9Updated 2 years ago