JacoCheung / hpccourse
ucas hpc course code
☆13Updated last year
Alternatives and similar repositories for hpccourse:
Users that are interested in hpccourse are comparing it to the libraries listed below
- Source code of the PPoPP '22 paper: "TileSpGEMM: A Tiled Algorithm for Parallel Sparse General Matrix-Matrix Multiplication on GPUs" by Y…☆39Updated 10 months ago
- A sparse BLAS lib supporting multiple backends☆43Updated last month
- performance engineering☆30Updated 9 months ago
- ☆26Updated last year
- ☆29Updated 9 months ago
- ☆10Updated 2 years ago
- Spack package repository maintained by Student Cluster Competition Team @ Sun Yat-sen University.☆16Updated 4 months ago
- UCAS 高性能计算系统 mpi☆12Updated 5 years ago
- An implementation of HPL-AI Mixed-Precision Benchmark based on hpl-2.3☆27Updated 3 years ago
- A highly efficient library for GEMM operations on Sunway TaihuLight☆17Updated 4 years ago
- ☆11Updated 2 years ago
- UCAS High Performance Computing System 国科大高性能计算系统复习及试题☆13Updated 2 years ago
- Source code of the SC '23 paper: "DASP: Specific Dense Matrix Multiply-Accumulate Units Accelerated General Sparse Matrix-Vector Multipli…☆26Updated 10 months ago
- Artifact for PPoPP22 QGTC: Accelerating Quantized GNN via GPU Tensor Core.☆27Updated 3 years ago
- Fast Synchronization-Free Algorithms for Parallel Sparse Triangular Solves with Multiple Right-Hand Sides (SpTRSM)☆12Updated 5 years ago
- Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.☆85Updated 2 years ago
- A New Format for SIMD-accelerated SpMV☆21Updated 3 years ago
- 中国科学院大学 高性能计算系统2021春☆8Updated 3 years ago
- Personal Notes for Learning HPC & Parallel Computation [Active Adding New Content]☆63Updated 2 years ago
- Documentation for HPC course☆147Updated this week
- SpV8 is a SpMV kernel written in AVX-512. Artifact for our SpV8 paper @ DAC '21.☆29Updated 4 years ago
- ☆102Updated last week
- Mirror of http://gitlab.hpcrl.cse.ohio-state.edu/chong/ppopp19_ae, refactoring for understanding☆14Updated 3 years ago
- Implementation and analysis of five different GPU based SPMV algorithms in CUDA☆39Updated 6 years ago
- Domain-specific framework for performance analysis of parallel programs☆16Updated last month
- Fast GPU based tensor core reductions☆13Updated 2 years ago
- Solution of Programming Massively Parallel Processors☆43Updated last year
- PanguLU: A Scalable Regular Two-Dimensional Block-Cyclic Sparse Direct Solver on Distributed Heterogeneous Systems☆36Updated 4 months ago
- ☆50Updated 5 years ago
- Light-weight Performance Variance Detection for Production-run Parallel Applications☆13Updated last year