haswelliris / CPC2018-GROMACSLinks
CPC2018第二届国产CPU并行应用挑战赛决赛
☆11Updated 7 years ago
Alternatives and similar repositories for CPC2018-GROMACS
Users that are interested in CPC2018-GROMACS are comparing it to the libraries listed below
Sorting:
- CSR5-based SpMV on CPUs, GPUs and Xeon Phi☆108Updated last year
- Medusa: Building GPU-based Parallel Sparse Graph Applications with Sequential C/C++ Code☆63Updated 5 years ago
- A highly efficient library for GEMM operations on Sunway TaihuLight☆18Updated 5 years ago
- ☆95Updated 8 years ago
- Asynchronous Multi-GPU Programming Framework☆48Updated 4 years ago
- GPU Code optimizer for stencil computations. Refer to our IPDPS'19 paper for more details☆24Updated 6 years ago
- PanguLU: A Scalable Regular Two-Dimensional Block-Cyclic Sparse Direct Solver on Distributed Heterogeneous Systems☆44Updated 4 months ago
- Graph500 reference implementations☆181Updated 3 years ago
- A Thread-Level Synchronization-Free Sparse Triangular Solve on GPUs☆55Updated 4 years ago
- A Synchronization-Free Algorithm for Parallel Sparse Triangular Solves (SpTRSV)☆22Updated 5 years ago
- Intermediate MPI lesson☆27Updated 2 years ago
- openmp examples☆150Updated 6 years ago
- ParMETIS - Parallel Graph Partitioning and Fill-reducing Matrix Ordering☆167Updated 2 years ago
- A sparse BLAS lib supporting multiple backends☆49Updated last month
- Fast Synchronization-Free Algorithms for Parallel Sparse Triangular Solves with Multiple Right-Hand Sides (SpTRSM)☆13Updated 5 years ago
- 为 Eijhout 教授的Introduction to HPC提供中文翻译、 PPT和Lab。☆328Updated 3 years ago
- ☆25Updated 4 years ago
- ☆14Updated 7 years ago
- The Task-Aware MPI (TAMPI) library extends the functionality of standard MPI libraries by providing new mechanisms for improving the inte…☆25Updated 6 months ago
- XSBench: The Monte Carlo Macroscopic Cross Section Lookup Benchmark☆84Updated last year
- Hybrid methods for Parallel Betweenness Centrality on the GPU☆24Updated 7 years ago
- A Deep Learning Framework customized for Sunway TaihuLight☆41Updated 6 years ago
- This tool serves as a test harness for different optimization techniques to improve stencil computations performance in shared and distri…☆20Updated 3 years ago
- PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core …☆75Updated 2 months ago
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆212Updated 3 weeks ago
- Classical molecular dynamics proxy application.☆32Updated 5 years ago
- Example code for Intel AVX / AVX2 intrinsics.☆143Updated 2 years ago
- LonestarGPU: Irregular algorithms parallelized for GPUs☆38Updated 6 years ago
- Parallel Tensor Infrastructure (ParTI!)☆33Updated 5 years ago
- CUSP : A C++ Templated Sparse Matrix Library☆420Updated 4 months ago