UoB-HPC / ipu-hpc-cookbookLinks
Useful tutorials and recipes for developers doing low-level work with the Graphcore IPU
☆21Updated 3 years ago
Alternatives and similar repositories for ipu-hpc-cookbook
Users that are interested in ipu-hpc-cookbook are comparing it to the libraries listed below
Sorting:
- ROCm Communication Collectives Library (RCCL)☆349Updated this week
- ☆19Updated last year
- Kernel Tuner☆353Updated last week
- Poplar libraries☆119Updated last year
- Reference implementations of MLPerf™ HPC training benchmarks☆48Updated 4 months ago
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆343Updated this week
- Forked from https://bitbucket.org/berkeleylab/cs-roofline-toolkit/src/master/☆20Updated 6 years ago
- Advanced Profiling and Analytics for AMD Hardware☆159Updated this week
- Provides the examples to write and build Habana custom kernels using the HabanaTools☆22Updated 3 months ago
- Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite☆66Updated 6 years ago
- Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018☆72Updated 4 years ago
- STREAM, for lots of devices written in many programming models☆345Updated 10 months ago
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆280Updated last month
- RCCL Performance Benchmark Tests☆70Updated this week
- A Parallel Code Evaluation Benchmark☆34Updated last month
- Experimental projects related to TensorRT☆107Updated this week
- A library of GPU kernels for sparse matrix operations.☆270Updated 4 years ago
- Online CUDA Occupancy Calculator☆79Updated 3 years ago
- 🎃 GPU load-balancing library for regular and irregular computations.☆62Updated last year
- Instructions, Docker images, and examples for Nsight Compute and Nsight Systems☆132Updated 5 years ago
- ❤️ CUDA/C++ GPU graph analytics simplified.☆31Updated 2 years ago
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆92Updated this week
- ☆10Updated 2 years ago
- oneAPI Collective Communications Library (oneCCL)☆238Updated last week
- High-performance, GPU-aware communication library☆86Updated 6 months ago
- Stores documents and resources used by the OpenXLA developer community☆126Updated 11 months ago
- Training material for Nsight developer tools☆161Updated 11 months ago
- Unified Collective Communication Library☆259Updated this week
- CUDA Kernel Benchmarking Library☆682Updated last week
- Assembler for NVIDIA Volta and Turing GPUs☆224Updated 3 years ago