UoB-HPC / ipu-hpc-cookbook
Useful tutorials and recipes for developers doing low-level work with the Graphcore IPU
☆21Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for ipu-hpc-cookbook
- Poplar libraries☆116Updated last year
- Kernel Tuner☆286Updated this week
- Reference implementations of MLPerf™ HPC training benchmarks☆41Updated 5 months ago
- TensorFlow for the IPU☆77Updated last year
- High-performance, GPU-aware communication library☆84Updated 2 weeks ago
- RCCL Performance Benchmark Tests☆48Updated 2 weeks ago
- HPCG benchmark based on ROCm platform☆35Updated last week
- Samples demonstrating how to use the Compute Sanitizer Tools and Public API☆65Updated last year
- ROCm Communication Collectives Library (RCCL)☆267Updated this week
- Benchmark for measuring the performance of sparse and irregular memory access.☆75Updated 2 weeks ago
- ☆41Updated 4 years ago
- Training material for IPU users: tutorials, feature examples, simple applications☆87Updated last year
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆100Updated last year
- NCCL Examples from Official NVIDIA NCCL Developer Guide.☆13Updated 6 years ago
- ROC_SHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆39Updated last year
- Experimental projects related to TensorRT☆77Updated this week
- Instructions, Docker images, and examples for Nsight Compute and Nsight Systems☆127Updated 4 years ago
- A hierarchical collective communications library with portable optimizations☆21Updated 4 months ago
- PyTorch interface for the IPU☆176Updated last year
- Graph algorithms for machine learning frameworks☆27Updated last year
- Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite☆60Updated 6 years ago
- ☆17Updated 9 months ago
- 🎃 GPU load-balancing library for regular and irregular computations.☆57Updated 4 months ago
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆44Updated 3 weeks ago
- STREAM, for lots of devices written in many programming models☆325Updated 2 months ago
- Chai☆42Updated 11 months ago
- Next generation SPARSE implementation for ROCm platform☆116Updated this week
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆42Updated 3 weeks ago
- Poplar Advanced Runtime for the IPU☆6Updated 9 months ago
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆99Updated this week