ut-osa / gpunet
GPUnet is a native GPU networking layer that provides a socket abstraction over Infiniband to GPU programs for NVIDIA GPUs.
☆109Updated 9 years ago
Alternatives and similar repositories for gpunet:
Users that are interested in gpunet are comparing it to the libraries listed below
- GPUfs - File system support for NVIDIA GPUs☆93Updated 6 years ago
- ☆31Updated 7 years ago
- GPUDirect Async support for IB Verbs☆112Updated 2 years ago
- OFI Programmer's Guide☆52Updated 2 years ago
- ☆42Updated 7 years ago
- User-space Page Management☆107Updated 8 months ago
- ☆32Updated 7 years ago
- Pointer-chasing memory benchmark (forked from Doug Pase's code).☆59Updated 11 years ago
- MapReduce for multi-core☆49Updated 11 years ago
- A kernel module to enable RDMA transfers to/from IO, PFN and DAX mapped memory☆10Updated 9 years ago
- Parallel Memory Bandwidth Measurement / Benchmark Tool☆110Updated 2 years ago
- ☆72Updated 8 years ago
- Blaze runtime system that support efficient accelerator integration for big data.☆24Updated 8 years ago
- pytorch ucc plugin☆21Updated 3 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆28Updated 7 months ago
- NVIDIA GPU direct RDMA using SISCI API☆16Updated 7 years ago
- Distributed Shared Persistent Memory. SoCC 2017☆69Updated 4 years ago
- Collection of synchronization micro-benchmarks and traces from infrastructure applications☆41Updated this week
- A tool for measuring the cache-coherence latencies of a processor (i.e., the latencies of loads, stores, CAS, FAI, TAS, and SWAP).☆78Updated 3 years ago
- ☆67Updated 8 years ago
- Linux Cross-Memory Attach☆93Updated 7 months ago
- an API and runtime environment for data processing with MapReduce for shared-memory multi-core & multiprocessor systems.☆97Updated last year
- A low-overhead tool to periodically collect system-wide hardware performance counters on Intel64 systems.☆33Updated 2 years ago
- Tensorflow is a computational library using data flow graphs for scalable machine learning, and Tensorflow-RDMA is the implementation ov…☆58Updated 2 years ago
- Enabling on-the-fly manipulations with LLVM IR code of CUDA sources☆111Updated 2 weeks ago
- A fast in-memory key-value store☆49Updated 7 years ago
- Infiniband verbs performance tests (fork of git://git.openfabrics.org/~grockah/perftest.git)☆18Updated 9 years ago
- Donard: A PCIe Peer-2-Peer kernel patch and library that builds on top of NVM. Express. Also see https://github.com/sbates130272/linux-do…☆31Updated 8 years ago
- DRAM Bank-Aware Kernel Memory Allocator☆42Updated 3 months ago
- a multi-node fabric-attached memory manager that provides simple abstractions for accessing and allocating NVM from fabric-attached memor…☆10Updated 11 months ago