suco-gt / HPC-InternshipsLinks
Supercomputing @ GT has compiled a list of organizations that offer internships and experiences in HPC and applications of HPC.
☆75Updated last month
Alternatives and similar repositories for HPC-Internships
Users that are interested in HPC-Internships are comparing it to the libraries listed below
Sorting:
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆305Updated last month
- A curated list of awesome high performance computing resources☆1,046Updated last week
- NVIDIA tools guide☆143Updated 9 months ago
- Example Makefile for CUDA and C++ source files in a standard project layout.☆48Updated 7 years ago
- Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)☆876Updated last year
- ☆192Updated last year
- collection of benchmarks to measure basic GPU capabilities☆429Updated 8 months ago
- Rodinia benchmark☆18Updated last year
- grmonty: relativistic Monte Carlo code☆48Updated 11 months ago
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆818Updated 3 weeks ago
- Step-by-step optimization of CUDA SGEMM☆388Updated 3 years ago
- ☆20Updated 4 months ago
- Main Book repository for the Parallel and High Performance Computing book, Manning Publications☆215Updated 3 years ago
- Instructions, Docker images, and examples for Nsight Compute and Nsight Systems☆133Updated 5 years ago
- Implementation and analysis of five different GPU based SPMV algorithms in CUDA☆40Updated 6 years ago
- Fast CUDA matrix multiplication from scratch☆895Updated last month
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆119Updated this week
- Parallel Code Evaluation Benchmark☆38Updated 4 months ago
- Advanced Matrix Extensions (AMX) Guide☆102Updated 3 years ago
- NVIDIA curated collection of educational resources related to general purpose GPU programming.☆747Updated this week
- CUDA Learning guide☆455Updated last year
- Stepwise optimizations of DGEMM on CPU, reaching performance faster than Intel MKL eventually, even under multithreading.☆152Updated 3 years ago
- Simple neural network implementation using CUDA technology. It is an educational implementation.☆97Updated 7 years ago
- IMPACT GPU Algorithms Teaching Labs☆58Updated 2 years ago
- CUDA Matrix Multiplication Optimization☆228Updated last year
- Kernel Tuner☆368Updated last week
- 📚 A curated list of awesome matrix-matrix multiplication (A * B = C) frameworks, libraries and software☆54Updated 7 months ago
- A set of hands-on tutorials for CUDA programming☆240Updated last year
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆209Updated 5 months ago
- Code from the "CUDA Crash Course" YouTube series by CoffeeBeforeArch☆879Updated 2 years ago