ColfaxResearch / HOW-Series-LabsLinks
Practical exercises for HOW Series "Deep Dive", a Web-based training on parallel programming and performance optimization
☆34Updated 6 years ago
Alternatives and similar repositories for HOW-Series-Labs
Users that are interested in HOW-Series-Labs are comparing it to the libraries listed below
Sorting:
- Intel(R) Machine Learning Scaling Library is a library providing an efficient implementation of communication patterns used in deep learn…☆108Updated 2 years ago
- BLAS-like Library Instantiation Software Framework☆150Updated 3 weeks ago
- A Deep Learning Meta-Framework and HPC Benchmarking Library☆81Updated 3 years ago
- LaTeX Examples Document Source☆248Updated 8 months ago
- The SHOC Benchmark Suite☆257Updated 3 years ago
- GPUOCelot: A dynamic compilation framework for PTX☆288Updated 2 years ago
- Information about many aspects of high-performance computing. Wiki content moved to ~/docs.☆301Updated last month
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆246Updated this week
- Python interface for the LIKWID C API (https://github.com/RRZE-HPC/likwid)☆46Updated 2 months ago
- Tutorials for the usage of the Uni.lu HPC platform☆152Updated last month
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆152Updated this week
- High Performance Linpack for GPUs (Using OpenCL, CUDA, CAL)☆91Updated 9 years ago
- Samples demonstrating how to use the Compute Sanitizer Tools and Public API☆88Updated last year
- The OpenDwarfs project provides a benchmark suite consisting of different computation/communication idioms, i.e., dwarfs, for state-of-ar…☆98Updated 6 years ago
- This is an archive of materials produced for an introductory class on CUDA programming at Stanford University in 2010☆223Updated 3 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆123Updated this week
- Flexible GPGPU instrumentation☆88Updated 5 years ago
- Livermore Big Artificial Neural Network Toolkit☆229Updated 4 months ago
- C++ HPC Tutorial materials☆55Updated last year
- Bridge to connect nGraph with TensorFlow☆52Updated 2 years ago
- Reference implementations of MLPerf™ HPC training benchmarks☆49Updated 7 months ago
- This is a set of simple programs that can be used to explore the features of a parallel platform.☆455Updated last month
- Examples for HIP☆210Updated 9 months ago
- RCCL Performance Benchmark Tests☆76Updated last week
- tools to create performance and roofline plots from measured data☆59Updated 11 years ago
- Source code for 'Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL' by James Reinders, Ben A…☆278Updated 6 months ago
- Reference workloads for modern deep learning methods.☆73Updated 2 years ago
- Kernel Tuner☆361Updated last week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆162Updated this week
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆106Updated 8 years ago