Prototype of OpenSHMEM for NVIDIA GPUs, developed as part of DoE Design Forward
☆25Apr 26, 2018Updated 7 years ago
Alternatives and similar repositories for df-nvshmem-prototype
Users that are interested in df-nvshmem-prototype are comparing it to the libraries listed below
Sorting:
- Aries Network Performance Counters Monitoring Library☆11Nov 19, 2020Updated 5 years ago
- GPUDirect Async implementation of HPGMG-FV CUDA☆11May 11, 2018Updated 7 years ago
- A Monte Carlo transport mini-app for studying new parallel algorithms☆18Feb 13, 2026Updated 2 weeks ago
- Open Fabric Interfaces☆16Jul 16, 2020Updated 5 years ago
- ☆17Sep 15, 2021Updated 4 years ago
- High Performance C++ Turbulent flow Lattice Boltzmann code☆17Sep 19, 2019Updated 6 years ago
- ☆18Jan 17, 2024Updated 2 years ago
- FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …☆32Dec 21, 2024Updated last year
- MoSAIC: Modular system for Acceleration Integration MoSAIC☆10Aug 22, 2025Updated 6 months ago
- Effective transpose on Hopper GPU☆28Sep 6, 2025Updated 5 months ago
- Open-source Python package for a wide range of tasks in modeling cardiac electrophysiology using finite-difference methods.☆12Updated this week
- Comb is a communication performance benchmarking tool.☆26Feb 27, 2023Updated 3 years ago
- MPI accelerator-integrated communication extensions☆39Apr 4, 2023Updated 2 years ago
- NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.☆122Nov 15, 2023Updated 2 years ago
- This repository contains an implementation for Portals4. Portals4 is a Network Programming Interface which allows high-performance networ…☆14Sep 3, 2024Updated last year
- MiniAMR Adaptive Mesh Refinement (AMR) Mini-App☆38Nov 12, 2024Updated last year
- robust geometric predicates☆38Oct 26, 2020Updated 5 years ago
- Core OpenEP code - Matlab implementation☆11Jan 25, 2026Updated last month
- GPU implementation of classical molecular dynamics proxy application.☆31Jan 30, 2017Updated 9 years ago
- A tracing infrastructure for heterogeneous computing applications.☆40Updated this week
- Pragmatic, Productive, and Portable Affinity for HPC☆51Updated this week
- ☆11Feb 13, 2017Updated 9 years ago
- LITS: An Optimized Learned Index for Strings☆13Jun 18, 2025Updated 8 months ago
- ☆15Dec 11, 2024Updated last year
- Finite-element library for analysis and adjoint-based gradient evaluation☆11Feb 6, 2026Updated 3 weeks ago
- A simple script to plot the Roofline model for given HW platforms and applications☆10Aug 22, 2024Updated last year
- Spark, Cassandra, Tessellation and ArcGIS☆10Jan 18, 2015Updated 11 years ago
- Haystack is an analytical cache model that given a program computes the number of cache misses.☆46Jul 15, 2019Updated 6 years ago
- Themis MapReduce and TritonSort☆11Nov 2, 2017Updated 8 years ago
- Implementation of COO, CSR, CSC, SSS and TJDS sparse matrix formats.☆11Jul 15, 2015Updated 10 years ago
- ☆11Feb 17, 2026Updated last week
- A dynamic GPU memory allocator, suitable for warp synchronized scenarios.☆11Aug 20, 2019Updated 6 years ago
- ☆15Apr 6, 2016Updated 9 years ago
- NVMesh Container Storage Interface (CSI) Driver for Kubernetes☆11Oct 7, 2024Updated last year
- Profile how CUDA applications create and modify data in memory.☆14Mar 22, 2018Updated 7 years ago
- Machine Learning for Cardiac Electrical Imaging☆10May 13, 2025Updated 9 months ago
- Fortran 2003 wrappers for POSIX threads☆12Oct 13, 2017Updated 8 years ago
- ☆23Dec 30, 2025Updated 2 months ago
- A simple demo of Google Charts using Flask and Jinja2.☆18Jun 26, 2011Updated 14 years ago