Intel® SHMEM - Device initiated shared memory based communication library
☆32Nov 12, 2025Updated 4 months ago
Alternatives and similar repositories for ishmem
Users that are interested in ishmem are comparing it to the libraries listed below
Sorting:
- ☆13Aug 28, 2025Updated 6 months ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆146Mar 10, 2026Updated last week
- ☆18Jan 17, 2024Updated 2 years ago
- Mini-applications that exclusively use the Kokkos programming model☆12Mar 21, 2023Updated 2 years ago
- Memory Topology for GPUs☆19Mar 4, 2026Updated 2 weeks ago
- A library for constructing allocators and memory pools. It also contains broadly useful abstractions and utilities for memory management.…☆88Updated this week
- NUMA-aware multi-CPU multi-GPU data transfer benchmarks☆28Oct 26, 2023Updated 2 years ago
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆60Updated this week
- SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUs☆69Updated this week
- Performance-portable C++ code for simulating elastic shear waves in an axisymmetric domain.☆13Jan 30, 2022Updated 4 years ago
- Directed Acyclic Graph Execution Engine (DAGEE) is a C++ library that enables programmers to express computation and data movement, as ta…☆47Oct 12, 2021Updated 4 years ago
- ☆10Mar 12, 2026Updated last week
- AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming☆181Mar 13, 2026Updated last week
- iSNS server and client for Linux☆33Nov 5, 2024Updated last year
- PyNucleus is a finite element code that specifically targets nonlocal operators.☆14Feb 11, 2026Updated last month
- ☆14Mar 1, 2025Updated last year
- ☆48Mar 10, 2026Updated last week
- ☆38Jun 26, 2024Updated last year
- oneAPI Level Zero Conformance & Performance test content☆60Updated this week
- Python library to add support for embedding natural code in Python with shared program state.☆24Jan 20, 2026Updated last month
- NVIDIA Networking NIC Configuration Operator For Kubernetes☆15Updated this week
- Experimental Explicit Communications API for Kokkos☆33Mar 5, 2026Updated 2 weeks ago
- Distributed View Extension for Kokkos☆50Dec 2, 2024Updated last year
- ☆19Jan 21, 2026Updated last month
- some mixture of experts architecture implementations☆26Mar 22, 2024Updated last year
- oneAPI Collective Communications Library (oneCCL)☆257Feb 4, 2026Updated last month
- ☆46Dec 10, 2025Updated 3 months ago
- Spatial Transformer Network (STN) provides attention to a particular region to in an image, by doing transformation to the input image. T…☆15Dec 21, 2020Updated 5 years ago
- Classifies percussion audio samples with a CNN-LSTM, written in python and pytorch. Also exports to Drumkv1 (lv2 plugin)☆14Aug 20, 2020Updated 5 years ago
- QMCPACK miniapp: a simplified real space QMC code for algorithm development, performance portability testing, and computer science experi…☆27Jul 24, 2024Updated last year
- A C library for using the perf API on Linux☆18Apr 19, 2024Updated last year
- MPI accelerator-integrated communication extensions☆40Apr 4, 2023Updated 2 years ago
- pika is a C++ tasking library built on std::execution with fibers, CUDA, HIP, and MPI support.☆84Updated this week
- a small lightweight std::execution work-alike☆66Mar 26, 2025Updated 11 months ago
- Managed collective communication service☆23Sep 2, 2024Updated last year
- Agent skills for vLLM☆47Mar 3, 2026Updated 2 weeks ago
- Recursos e pdfs com uma introdução à programação em CUDA☆24May 7, 2018Updated 7 years ago
- ☆21Nov 19, 2021Updated 4 years ago
- portDNN is a library implementing neural network algorithms written using SYCL☆114May 21, 2024Updated last year