williamfgc / simple-gemmLinks
Collection of simple General Matrix Multiplication - GEMM implementations
☆13Updated last year
Alternatives and similar repositories for simple-gemm
Users that are interested in simple-gemm are comparing it to the libraries listed below
Sorting:
- Proof of Concept: a C-callable GPU-enabled parallel 2-D heat diffusion solver written in Julia using CUDA, MPI and graphics☆24Updated 4 years ago
- A forwarding MPI implementation that can use any other MPI implementation via an MPI ABI☆48Updated 9 months ago
- HPC Examples and Documentation for Julia☆13Updated 2 years ago
- Flexible and performant GEMM kernels in Julia☆82Updated this week
- Julia implementation of LULESH with MPI + X.☆13Updated 2 years ago
- Information on how to set up Julia on HPC systems☆38Updated 2 years ago
- Julia bindings for NVTX, for instrumenting with the Nvidia Nsight Systems profiler☆35Updated 3 weeks ago
- A version of the STREAM benchmark which measures the sustainable memory bandwidth.☆27Updated 11 months ago
- Julia High Performance☆24Updated 6 years ago
- Development of SuiteSparse.jl, which ships as part of the Julia standard library.☆26Updated 2 years ago
- Julia package for hierarchical matrices☆28Updated 8 months ago
- Global Address SPace toolbox -- Julia wrapper☆10Updated 7 years ago
- Julia parallel constructs over MPI☆45Updated 2 months ago
- Parallel CPU and GPU high-performance computing - short course☆37Updated 4 years ago
- Automatic differentiation of FEniCS and Firedrake models in Julia☆13Updated 4 years ago
- Slides from the "Julia for HPC" minisymposium at JuliaCon 2022☆18Updated 2 years ago
- Julia HPC miniapp using parallel models (MPI.jl, CUDA.jl, AMDGPU.jl, ADIOS2.jl) and Jupyter/Pluto.jl notebooks☆22Updated 3 weeks ago
- ☆30Updated 2 years ago
- Inspecting GPUs with Julia☆45Updated last year
- Data validation and settings management in Julia☆13Updated 4 years ago
- Physics-Enhanced Regression for Initial Value Problems☆20Updated last year
- Programming Gemm Kernels on NVIDIA GPUs with Tensor Cores in Julia☆41Updated 2 months ago
- ☆19Updated 7 months ago
- Linnea is an experimental tool for the automatic generation of optimized code for linear algebra problems.☆70Updated 3 years ago
- Common types and interface for discretizers of ModelingToolkit PDESystems.☆13Updated last month
- GPU integrations for Dagger.jl☆53Updated 2 weeks ago
- Training materials for ModelingToolkit and JuliaSim☆38Updated 2 years ago
- Wrappers for the SciPy differential equation solvers for the SciML Scientific Machine Learning organization☆22Updated 2 months ago
- x86 Hardware Performance Counter monitoring in Julia☆20Updated 3 years ago
- ITT☆14Updated 8 months ago