IBM / zDNNLinks
IBM Z Deep Neural Network Library (zDNN) provides an interface for applications making use of Neural Network Processing Assist Facility (NNPA).
☆20Updated 3 months ago
Alternatives and similar repositories for zDNN
Users that are interested in zDNN are comparing it to the libraries listed below
Sorting:
- PMIx Reference RunTime Environment (PRRTE)☆52Updated last week
- The ultimate bandwidth benchmark☆60Updated 2 weeks ago
- Open source of an IBM Optimized version of the HPCG benchmark.☆17Updated 3 months ago
- ☆18Updated last year
- Nanos6 is a runtime that implements the OmpSs-2 parallel programming model, developed by the System Tools and Advanced Runtimes (STAR) gr…☆22Updated 6 months ago
- Scripts to build AMD ROCm from source.☆16Updated last year
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆54Updated last week
- A tracing infrastructure for heterogeneous computing applications.☆39Updated last week
- The LLVM DOE Fork is a fork of upstream LLVM (https://github.com/llvm/llvm-project/) that hosts multiple DOE-funded projects. Contact in…☆25Updated this week
- ☆10Updated last week
- CPU and GPU tutorial examples☆13Updated 8 months ago
- Tutorials for Timemory☆21Updated last year
- Benchmarks☆17Updated 8 months ago
- MPI accelerator-integrated communication extensions☆39Updated 2 years ago
- ☆17Updated 3 weeks ago
- HPCG benchmark based on ROCm platform☆38Updated 2 months ago
- COCCL: Compression and precision co-aware collective communication library☆29Updated 9 months ago
- ☆38Updated last week
- ☆14Updated 5 years ago
- A unified framework across multiple programming platforms☆42Updated 7 months ago
- PMIx Standard Document☆26Updated last month
- Codeplay project for contributions to the LLVM SYCL implementation☆30Updated 4 years ago
- oneAPI Data Parallel C++ (DPC++) language reference☆26Updated 3 years ago
- Data and reproducibility scripts for the UoB-HPC Performance Portability studies☆18Updated last year
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆154Updated 3 weeks ago
- ☆29Updated 3 years ago
- Yaksa: High-performance Noncontiguous Data Management☆14Updated 2 months ago
- OpenMP offload playground☆10Updated last year
- OpenMP vs Offload☆23Updated 2 years ago
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆64Updated 3 weeks ago