puneetar / Parallel-LU-Factorization-with-OpenMP-MPILinks
Problem: LU Factorization using OpenMP and MPI: study of scalability.
☆15Updated 12 years ago
Alternatives and similar repositories for Parallel-LU-Factorization-with-OpenMP-MPI
Users that are interested in Parallel-LU-Factorization-with-OpenMP-MPI are comparing it to the libraries listed below
Sorting:
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆115Updated 2 years ago
- ulmBLAS☆107Updated 8 months ago
- MPI Tutorial Exercises☆46Updated 12 years ago
- Code repo for lotsofcores.com book 1, here since dropbox doesn't work for everyone☆27Updated 9 years ago
- A fast shared & distributed memory task-based runtime in C++☆28Updated 4 years ago
- A light-weight MPI profiler.☆105Updated 4 months ago
- Information about many aspects of high-performance computing. Wiki content moved to ~/docs.☆312Updated last month
- High Performance Linpack for GPUs (Using OpenCL, CUDA, CAL)☆92Updated 10 years ago
- MGARD: MultiGrid Adaptive Reduction of Data☆46Updated this week
- The Task-Aware MPI (TAMPI) library extends the functionality of standard MPI libraries by providing new mechanisms for improving the inte…☆25Updated 8 months ago
- Some example MPI programs☆101Updated 14 years ago
- A task benchmark☆44Updated last year
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆47Updated 2 years ago
- A dynamic analysis tool to detect floating-point errors in HPC applications.☆39Updated 3 weeks ago
- Instructions and templates for SC authors☆17Updated 4 years ago
- Loop Kernel Analysis and Performance Modeling Toolkit☆96Updated 10 months ago
- DASH, the C++ Template Library for Distributed Data Structures with Support for Hierarchical Locality for HPC and Data-Driven Science☆162Updated 4 years ago
- Scalable GPU Kernel Fission/Fusion Transformation for Memory-Bound Kernels☆14Updated 10 years ago
- ytopt: machine-learning-based autotuning and hyperparameter optimization framework using Bayesian Optimization☆49Updated this week
- Introduction to CUDA programming☆129Updated 8 years ago
- Very-Low Overhead Checkpointing System☆59Updated 6 months ago
- mirror from http://lotsofcores.com book 2, since dropbox isn't good for everyone☆37Updated 9 years ago
- Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"☆31Updated 5 years ago
- Integrated Performance Monitoring for High Performance Computing☆91Updated 4 years ago
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆95Updated 4 years ago
- RAJA Performance Suite☆130Updated this week
- SCR caches checkpoint data in storage on the compute nodes of a Linux cluster to provide a fast, scalable checkpoint / restart capability…☆103Updated 3 months ago
- A unified framework across multiple programming platforms☆43Updated 8 months ago
- Error-bounded Lossy Data Compressor (for floating-point/integer datasets)☆168Updated 3 months ago
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆81Updated 6 months ago