suco-gt / HPC-InternshipsLinks
Supercomputing @ GT has compiled a list of organizations that offer internships and experiences in HPC and applications of HPC.
☆65Updated last year
Alternatives and similar repositories for HPC-Internships
Users that are interested in HPC-Internships are comparing it to the libraries listed below
Sorting:
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆268Updated this week
- NVIDIA tools guide☆133Updated 5 months ago
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆56Updated last month
- A Parallel Code Evaluation Benchmark☆29Updated 2 weeks ago
- A curated list of awesome high performance computing resources☆902Updated last month
- CSC Summer School in High-Performance Computing☆107Updated this week
- Instructions, Docker images, and examples for Nsight Compute and Nsight Systems☆131Updated 5 years ago
- N-Ways to Multi-GPU Programming☆25Updated 2 years ago
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆206Updated 3 weeks ago
- Step-by-step optimization of CUDA SGEMM☆333Updated 3 years ago
- Stepwise optimizations of DGEMM on CPU, reaching performance faster than Intel MKL eventually, even under multithreading.☆148Updated 3 years ago
- Main Book repository for the Parallel and High Performance Computing book, Manning Publications☆206Updated 3 years ago
- NVIDIA curated collection of educational resources related to general purpose GPU programming.☆460Updated last week
- Fast CUDA matrix multiplication from scratch☆730Updated last year
- Solution of Programming Massively Parallel Processors☆47Updated last year
- Rodinia benchmark☆17Updated 11 months ago
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆86Updated last week
- collection of benchmarks to measure basic GPU capabilities☆377Updated 3 months ago
- Some CUDA projects and utility☆29Updated 5 years ago
- CUDA Matrix Multiplication Optimization☆189Updated 10 months ago
- COCCL: Compression and precision co-aware collective communication library☆22Updated 2 months ago
- Benchmark for measuring the performance of sparse and irregular memory access.☆78Updated last month
- IMPACT GPU Algorithms Teaching Labs☆57Updated 2 years ago
- ☆123Updated this week
- Example Makefile for CUDA and C++ source files in a standard project layout.☆48Updated 7 years ago
- Serial and parallel implementations of matrix multiplication☆41Updated 4 years ago
- 📚 A curated list of awesome matrix-matrix multiplication (A * B = C) frameworks, libraries and software☆35Updated 3 months ago
- Implementation and analysis of five different GPU based SPMV algorithms in CUDA☆40Updated 6 years ago
- ☆9Updated last year
- Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)☆775Updated 9 months ago