ColfaxResearch / HOW-Series-LabsLinks
Practical exercises for HOW Series "Deep Dive", a Web-based training on parallel programming and performance optimization
☆33Updated 6 years ago
Alternatives and similar repositories for HOW-Series-Labs
Users that are interested in HOW-Series-Labs are comparing it to the libraries listed below
Sorting:
- Intel(R) Machine Learning Scaling Library is a library providing an efficient implementation of communication patterns used in deep learn…☆108Updated 2 years ago
- Compute applications.☆24Updated 5 years ago
- Python interface for the LIKWID C API (https://github.com/RRZE-HPC/likwid)☆46Updated 2 months ago
- Automatically exported from code.google.com/p/patus☆15Updated 9 years ago
- Contains sources related to the lectures and labs for the NVIDIA OpenACC course.☆51Updated 5 years ago
- SYCL-ML is a C++ library, implementing classical machine learning algorithms using SYCL.☆66Updated 5 years ago
- A Deep Learning Meta-Framework and HPC Benchmarking Library☆81Updated 3 years ago
- BLAS-like Library Instantiation Software Framework☆141Updated last week
- mirror from http://lotsofcores.com book 2, since dropbox isn't good for everyone☆38Updated 9 years ago
- ☆15Updated 9 years ago
- The OpenDwarfs project provides a benchmark suite consisting of different computation/communication idioms, i.e., dwarfs, for state-of-ar…☆99Updated 5 years ago
- YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-d…☆107Updated 11 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆121Updated this week
- C++ HPC Tutorial materials☆52Updated 11 months ago
- The SHOC Benchmark Suite☆256Updated 3 years ago
- SYCL Benchmark Suite☆65Updated this week
- A task benchmark☆43Updated 10 months ago
- Kernel Tuning Toolkit☆60Updated last month
- ☆20Updated 9 years ago
- GPU implementation of classical molecular dynamics proxy application.☆31Updated 8 years ago
- Loop Kernel Analysis and Performance Modeling Toolkit☆93Updated 3 months ago
- portDNN is a library implementing neural network algorithms written using SYCL☆113Updated last year
- ☆55Updated 2 years ago
- Examples for using SYCL on CUDA☆62Updated last week
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆105Updated 7 years ago
- Resources to work offline on the assignments of Heterogenous Parallel Programming course from Coursera.☆72Updated 5 years ago
- MIOpenGEMM is now deprecated☆62Updated last year
- Information about many aspects of high-performance computing. Wiki content moved to ~/docs.☆291Updated last month
- Training materials provided by OpenACC.org.☆93Updated 10 months ago
- Tools and extensions for CUDA profiling☆65Updated 5 years ago